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Chapter  I 


Introduction 


1.1.  Combinatorial  Algorithms. 

The  title  of  this  course  is  “Combinatorial  Algorithms”.  Fn  order  to  get  some  idea  of  what  this  means, 
let’s  look  at  the  two  words  in  turn  (and  remember  this  is  not  supposed  to  give  a  formal,  mathematical 
definition): 

-  to  explain  “algorithm”  it  should  actually  suffice  for  all  our  present  purposes  if  we  take  it  as  referring 
to  a  “correct  computer  program  (in,  say,  Pascal  or  Lisp)  guaranteed  to  terminate  on  all  inputs”,  or,  to 
be  less  idealistic,  on  all  allowed  inputs  (but  then  of  course  the  problem  arises  which  inputs  arc  allowed, 
and  how  do  we  distinguish  them!). 

-  the  semantics  of  “combinatorial”  is  harder  to  describe.  Classical  combinatorics  is  the  science  of  the 
properties  of  finite  collections  of  discrete  objects.  Of  course,  the  objects  might  be  anything  (like 
superdenumerable  ordinals),  but  from  a  combinational  point  of  view  we  are  only  interested  in  certain 
discrete  and  finitarily  represented  properties. 

If  that  is  all,  you  might  correctly  remark  that  the  word  “combinatorial”  in  “combinatorial  algorithms” 
is  simply  redundant  because  (digital)  computers  on  which  algorithms  supposedly  are  executed  are  definitely 
finitary  and  discrete. 

Hence  there  must  be  a  more  traditional  meaning  which  distinguishes  combinatorial  algorithms  from 
algorithms  in  general.  Let  me  try  to  put  this  necessarily  vague  notion  perhaps  as  follows: 

“Combinatorial  algorithms  are  those  dealing  with  problems  which  would  be  more 
or  less  trivial  if  one  could  inspect  all  possible  combinations  of  the  (discrete)  objects 
of  any  given  problem  instance.” 

Of  course,  this  is  a  very  indirect  explanation  of  what  a  combinatorial  algorithm  might  be,  but  it  has 
one  definite  merit  to  it:  it  highlights  the  fact  that  for  all  the  problems  we  shall  be  concerned  with  we  will 
have  to  search  for  solutions  other  and  better  than  enumerative  ones. 


1.2.  Machine  Models. 

We  now  want  to  take  a  somewhat  closer  look  at  the  computer  model  which  we  shall  have  in  mind 
in  most  of  the  cases,  or  at  least  in  the  back  of  our  mind.  The  reason  is  that  if  we  intend  to  make 
formal  (mathematical)  statements  about  computers  and  the  programs  running  on  them  we  also  have  to 
give  formal  definitions  of  what  computers  and  programs  are.  For  our  purposes,  however,  H  suffices  to 
obtain  some  rough  idea,  knowing  that  when  needed  there  would  be  a  whole  elaborated  theory  to  rely  on 
and  supply  all  the  missing  details. 

Thus,  our  machine  model  which  is  called  a  Random  Access  Machine  (RAM),  could  be  considered  as 
abstracted  from  an  (almost  real)  computer  built,  say,  at  the  beginning  of  the  sixties:  input  and  output 
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would  be  solely  via  punched  paper  tape,  program  storage  would  be  read-only  (quite  modern  again),  and 
there  would  be  just  one  register  serving  as  accumulator. 

A  more  formal  diagram  would  look  like 


read  head 


—  Diagram  of  RAM  — 

We  idealistically  assume  that  there  arc  infinitely  many  memory  cells  m0,mx, ...  ,  and  that  each  of 
them  can  hold  an  arbitrary  (signed)  integer.  (It  seems  necessary  to  remark  here  that  more  and  more  often 
these  two  assumptions  do  appear  as  idealistic;  certainly  it  is  possible  to  discard  the  second  assumption  (as 
even  the  definitions  of  some  new  programming  languages  do)  and  simulate  arbitrary  size  integers  without 
(at  least  theoretically)  undue  loss  of  efficiency,  but  also  disposing  of  the  first  would  just  leave  us  with 
finite  automata).  The  program  or  finite  state  control  of  a  RAM  very  much  looks  like  assembly  language 
with  one  address  instructions  allowing: 

-  direct,  immediate,  and  indirect  addressing  for  memory  transfer  and  the  basic  arithmetic  and  logical 
operations, 

■  conditional  and  unconditional  branching. 
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-  input  and  output,  and  one  or  a  few  control  statements  (like  STOP). 

We  do  not  specify  any  more  details  because  we  expect  that  we  will  never  have  to  use  this  rather 
awkward  machine  language,  and  assume  that  instead  have  a  very  sophisticated  compiler  from  a  high  level 
AlgoI60-  or  Pascal-like  language  which  even  allows  us  some  statements  in  natural  English.  This  will  make 
the  presentation  of  algorithms  closer  to  our  way  of  thinking  and  (hopefully)  easier  to  understand.  But 
when  analyzing  the  execution  of  such  programs  on  a  RAM  we  always  have  to  think  of  the  compiled 
version  as  being  executed. 


U.  Complexity  Measures. 

Now  that  we  have  presented  the  basic  features  of  our  formal  model  we  want  to  sec  how  to  use  it. 
Well,  we  would  like  to  employ  algorithms  and  computers  in  order  to  get  solutions  for  problems  in  which 
we  are  interested  and  from  which  we  expect  a  profit  in  some  sense.  Pursuing  this  economical  setup  a 
little  bit  further,  computers  are  scarce  resources,  and  we  incur  some  costs  using  them.  And,  naturally,  we 
are  interested  in  minimizing  these  costs. 

As  in  economics,  we  have  to  clarify  two  questions  first: 

(1)  how  is  the  cost  of  running  a  specific  algorithm  on  a  specific  (model  of  a)  computer  defined,  '.e.,  how 
do  wc  measure  cost? 

(2)  given  two  solutions  to  a  problem,  how  do  we  compare  their  respective  costs? 

1.3.1.  Complexity  Functions. 

Most  of  the  algorithms  we  shall  be  looking  at  are  able  to  solve  not  only  one  instance  of  a  problem 
(say,  determining  whether  the  lO.O^’st  digit  of  the  decimal  expansion  of  7r  is  5),  but  normally  (at  least  on 
our  idealized  computer  model,  the  RAM)  an  infinite  number  of  such  instances  (in  the  above  example: 
finding  out  whether  the  n-th  digit  of  the  decimal  expansion  of  n  is  5,  for  any  positive  integer  n  given 
to  the  algorithm).  And  actually  it  is  just  such  an  infinite  collection  of  instances  (given  by  means  of  some 
general  parameterization  of  by  a  defining  common  property)  what  wc  mean  by  a  problem. 

In  order  to  be  able  to  work  with  our  formal  computer  model  we  also  have  to  find  a  formal  definition 
of  a  problem.  Here,  an  instance  of  a  problem  would  be  given  by  some  string,  i.e.,  finite  sequence  of 
characters  over  some  fixed,  finite  alphabet  “known”  to  the  computer,  which  for  us  somehow  describes 
the  problem  instance  we  have  in  mind.  As  the  execution  of  an  algorithm,  given  some  string  as  input,  is 
independent  of  what  meaning  we  attach  to  the  string,  wc  arrive  at  the  following  formal 

Definition: 

(a)  A  problem  is  a  subset  L  of  the  set  E*  of  strings  over  some  (fixed,  finite)  alphabet  E. 

(b)  An  algorithm  A  (on  a  RAM)  recognizes  the  problem  L  if,  for  any  x  6  E*.  on  input  x  f  the  execution 
of  A  eventually  stops  and  the  output  is 

“yes”,  if  x  €  L, 

“no”,,  otherwise. 

(c)  We  say  that  A  accepts  L  if,  for  any  x  €  E*,  on  input  x  the  execution  of  A  eventually  stops  with 
output  “yes”  if  and  only  if  x  €  L. 

(So  we  do  not  care  what  A  docs  on  any  input  ytjJL  except  that  it  must  not  stop  with  output  “yes”.) 
Returning  to  our  trivial  example  above,  we  might  not  be  interested  only  whether  the  n-th  digit  of 
x  is  S,  but  instead  would  like,  on  input  n,  to  obtain  the  first  n  decimal  digits  of  x.  Here  we  are  not 
dealing  with  a  recognition  problem  with  a  simple  yes/no  answer,  but  with  the  more  general  problem  of 
computing  a  Junction,  say  from  E*  to  E*: 

*We  mme  by  convention  that  £  it  such  that  A  is  able  to  determine  the  end  of  input  on  its  input  tape. 
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Definition: 

Let  /:£*—>£*  be  a  (in  general  partial)  function.  An  algorithm  A  (on  a  RAM)  computes  f  if,  for 
any  i  6  £*,  on  input  x  the  execution  of  A  eventually  stops  if  and  only  if  f(x)  is  defined  if  and  only  if 
A  stops  with  output  f(x). 

For  a  given  problem  L  or  function  /,  there  naturally  are  —  as  in  real  world  economics  —  many 
different  “costs”  or,  as  we  call  them,  complexities ,  and  it  always  depends  on  the  circumstances  which 
ones  are  relevant  or  interesting.  Some  are  static  in  the  sense  that  they  do  not  depend  on  the  problem 
instances,  e.g. 

-  the  minimal  size  (=  number  of  instructions)  of  an  algorithm  which  recognizes  L  or  computes  /; 

-  the  minimal  number  of  branching  instructions  in  an  algorithm  for  L  or  /. 

Other  complexity  measures  are  dynamic  in  that  they  do  depend  on  problem  instances,  i.e.,  the  input  The 
two  most  important  kinds  of  dependencies  are 

-  worst-case  complexity,  where,  for  any  n  G  N,  the  maximum  cost  for  any  x  of  length  n  is  taken  with 

(a)  x  G  £*  for  a  recognition  problem, 

(b)  x  €  L  for  an  acceptance  problem,  and 

(c)  *  G  dom/  for  computing  a  function  /. 

-  expected  or  average  complexity,  where,  for  some  ’size’  function  and  probability  distributions  over  all 
inputs  of  the  same  ’size’,  the  average  cost  (according  to  the  corresponding  probability  distribution)  over 
all  inputs  of  each  ’size’  is  taken. 

We  shall  mostly  emphasize  worst-case  complexity  because  it  is  universal  (with  respect  to  the  underlying 
model)  and  does  not  depend  on  the  choice  of  more  or  less  arbitrary  probability  distributions. 

The  two  most  important  types  of  costs  or  complexity  which  we  shall  be  considering  are  those 
concerned  with  (computation)  time  and  (storage)  space. 

The  time  used  in  the  execution  of  an  algorithm  A  on  some  input  x  G  £*  is  the  sum  of  the  time 
spans  for  all  executions  of  the  basic  instructions  in  the  algorithm. 

The  space  used  in  the  execution  of  A  on  input  x  is  the  sum  of  the  ’spaces’  occupied  by  the  memory 
cells  referenced  during  the  execution  of  A. 

To  complete  the  definition  of  time  and  space  complexity  of  an  algorithm,  it  remains  to  state  the  time 
required  to  execute  each  RAM  instruction  and  the  space  used  by  a  memory  cell.  Again  there  are  two 
commonly  used  ways  to  define  these  costs.  The  first  is  called  the  uniform  cost  criterion.  It  attributes 
one  unit  of  cost  (time  or  space)  to  each  execution  of  a  basic  RAM  instruction  and  to  each  memory  cell, 
regardless  of  its  contents.  Unless  otherwise  mentioned  we  shall  refer  to  this  cost  criterion.  It  is  appropriate 
as  long  as  a  real  world  computer  can  simulate  the  RAM  algorithm  using  only  some  fixed  amount  of  time 
and  storage  words  per  basic  RAM  instruction  and  memory  cell.  In  principle  this  simply  means  that  the 
numbers  stored  in  the  RAM  memory  cells  must  not  become  too  big.  If  this  should  happen,  a  somewhat 
more  realistic  approach  might  be  the  so-called  logarithmic  cost  criterion  which  defines  the  space  used  by  a 
memory  cell  to  be  the  length  of  die  binary  representation  of  the  biggest  (in  absolute  value)  number  stored 
in  the  memory  cell  during  the  execution  of  the  algorithm,  and  which  assigns  to  each  basic  instruction  the 
sum  of  these  lengths  of  all  its  operands  as  the  time  span  used.  A  detailed  example  of  logarithmic  costs  for 
a  RAM  instruction  set  can  be  found  in  (AHU74J..  We  remark  that  the  time  and/or  space  complexity  of 
some  algorithm  may  be  widely  different  depending  on  whether  the  uniform  or  logarithmic  cost  criterion 
is  employed. 

1.3.2.  Asymptotic  Complexity. 

We  now  address  the  problem  of  comparing  complexity  functions.  Assume  that  for  a  certain  problem 
we  have  two  algorithms,  one  with  complexity  f(n)  =  n*-8‘  logn,  the  other  with  complexity  g(n)  = 
n,/logn  f.  Though  for  small  n  the  second  algorithm  has  a  smaller  complexity,  there  is  a  break  point 


'lot  n  denotes  the  real-valued  solution  to  2*  »  n. 
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n„  (r»0  s=s  39)  such  that  for  n  >  n0  the  situation  is  reversed  (if  the  complexity  considered  is  worst-case 
complexity  this  docs  not  mean  that  for  all  inputs  of  size  at  least  n0  the  first  algorithm  has  a  lower 
complexity  that  the  second;  there  must  be  however  infinitely  many  such  instances). 

Most  of  the  times  we  are  interested,  in  comparing  two  algorithms  when  applied  to  problem  instances 
of  ever  bigger  size.  The  complexity  of  an  algorithm  for  the  input  size  tending  to  infinity  is  called  its 
asymptotic  complexity,  and  using  it  as  criterion  we  might  therefore  say  that  the  first  algorithm  above  has 
a  lower  complexity  than  the  second  (note  however  that  in  practical  life  the  situation  is  not  as  simple, 
and  that  for  a  given  problem  instance  the  selection  of  the  better  algorithm  always  depends  on  the  break 
point). 

For  the  comparison  of  complexity  functions  in  the  asymptotic  sense,  we  use  the  following  notation: 
Definition: 

Let  /,  g  :  N  -»  N  be  two  functions;  we  say  that 

(a)  /  =  0(g)  if  3c  >  0  such  that  /  <  eg: 

(b)  /  =  »(<;)  ifVc  >  0  /  <„.e.  eg 

(/  <  a.e.  <7  means  f(n)  <  g(n)  for  all  but  finitely  many  n); 

(c)  /  =  n(gr)  if  3c  >  0  such  that  /  >a  t  eg: 

(d)  /  =  0(ff)  if  /  =  0(g)  and  /.=  n(9). 

(See  [Knu76J.) 

1.4.  Reduction  and  Recurrences. 

When  designing  an  algorithm  for  some  problem  it  often  happens  that  we  can  derive  a  solution  for  any 
instance  with,  say,  relatively  little  cost  if  we  only  have  the  solutions  to  a  few  other,  smaller  instances  of  the 
same  problem  or  of  instances  of  some  other  problem  which  we  know  how  to  solve.  In  the  second  case  it 
is  immediate  to  determine  the  complexity  of  an  algorithm  for  the  first  problem  which  uses  this  reduction, 
from  the  known  complexity  of  the  second  algorithm  and  the  additional  cost  for  the  reduction.  When  we 
wish  to  determine  the  complexity  of  an  algorithm  constructed  according  to  the  first  case,  however,  we 
usually  end  up  with  a  relation  of  the  following  form  for  the  complexity  C  of  our  algorithm; 

C(n)  =  F,(C(«“>, . . . ,  C(nW))  +  /<*(»), 

where  Ft  describes  the  dependency  on  the  complexities  of  the  smaller  problem  instances  of  size 
n(1), . . . ,  n(,),  and  represents  the  additional  cost  incurred  to  form  a  solution  for  the  original  problem 
instance  of  size  n.  We  shall  now  discuss  some  techniques  for  solving  such  recurrence  relations,  i.e.,  finding 
closed-form  expressions  for  the  function  C  when  the  function  F,  is  sufficiently  simple.  Nonetheless, 
these  methods  provide  a  systematic  way  to  deal  with  a  fairly  large  class  of  recurrence  equations  (also 
sec  (Lue80J).  The  techniques  are,  in  turn,  multipliers,  characteristic  polynomials,  generating  junctions,  and 
domain  and  range  transformations.  Wc  shall  demonstrate  each  technique  by  means  of  an  example  where, 
for  notational  convenience,  we  write  /„  for  f(n). 

1.4.1.  Multipliers 

Suppose  that  the  following  recurrence  relation  is  given: 

/»  =2/„_l  +  n;  for  n  >  2, 

/.=  l; 

That  is 


/»  —  2/„_1  +  n; 


/I 
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2/n-a  +  (n  -  1); 

/•  2 

2/n-j  +  (n  -  2); 

/•4 

2/„_ -f-  (n  -  a)» 

/  ‘  2* 

2/i  +  2; 

/•2" 

If  we  multiply  each  equation  with  the  multiplier  indicated  to  its  right,  then  sum  up  all  equations  and 
cancel  identical  terms  on  both  sides,  we  obtain 

/„  =2n-7«  +  £(»  -  *)2‘ 

»=0 

n 

=  -  i)2*  because  fx  =s  1 

i=0 
n—  1  t 

=  £  £2*  =  2n+l— n  — 2. 

*=o  y=o 

The  last  line  follows  by  elementary  transformations  (also  see  1.4.3.).  Hence  we  obtain 

/„  =  2n+1  -  n  -  2. 


1.4.2.  Characteristic  Polynomials. 

As  an  example,  suppose  we  are  given 


/»  — /n— 1  "+■  fn—ti 

/i=l,  /o=0. 


for  n  >  2 


Such  a  recurrence  is  called  a  homogeneous  linear  recurrence  with  constant  coefficients. 

If  we  try  to  set  /„  ==  an  for  some  o  still  to  be  determined  we  gel  ihe  following  characteristic  equation 
for  a  (assuming  that  a  ^  0): 


a*  —  a  —  1  =  0,  and  hence  ai>2  =  **■. 

« 

It  is  clear  that  every  linear  combination 

cjoJ  +  c*aj 

is  then  a  solution  to  the  recurrence  relation  /„  -  /B_t  -  /„_2  =  0  for  n  >  2.  The  boundary  conditions 
for  /0  and  /(  determine  specific  values  for  Cj  and  c2,  in  our  example  cj  =  1/n/5  =  — c2,  and  hence 


--Aim-mi 


The  question  is  whether  by  this  method  we  can  obtain  all  solutions  to  recurrences  like  the  one  above.  It 
is  not  hard  to  sec  that  this  is  true  if  all  roots  of  the  characteristic  polynomial  arc  distinct.  For  the  general 
case,  with  multiple  roots  allowed,  we  cite  tire  following  theorem  (for  a  proof  sec  [Liu68]): 


1.4.  Reduction  and  Recurrences. 
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Theorem: 

Let  p(x)  be  the  characteristic  polynomial  of  the  recurrence 
(*)  Po/n  +  Pl/n-l  +  • . .  +  Pkfn-k  =  0;  for  Tl  >  k 

with  constant  coefficients  pt.  Let  the  roots  of  p,  over  the  complex  numbers,  be  r,-,  i 
with  multiplicity  g,.  Then  all  solutions  of  (*)  are  given  by 

fn  =  E(r?  £  coni)> 

»=1  >=0 


where  the  ci3  are  arbitrary  constants. 


If  the  given  linear  recurrence  relation  (with  constant  coefficients)  is  inhomogene 
derive  a  homogeneous  recurrence  using  appropriate  multipliers,  and  then  apply  the 
above.  A  more  formal  treatment  can  be  found  in  [Lue80J. 


1.4.3.  Generating  Functions. 


For  a  sequence  (/„)„^o.  its  generating  Junction  F(z)  is  defined  as 


*■(*)  =  E  /»*"• 

ngo 

Just  using  calculus,  we  can  establish  the  following  table: 


generating  function 

n-th  element  of  the  corresponding  sequence 

cF 

c/n 

F  +  G 

/n  “i*  Qn 

FG 

!Ci= o  fi0n~i 

zkF 

if  n  <  fc  then  0  else  /„_* 

F(z)/(  1  -  z) 

E7=0  A 

**(•) 

fo'rm 

if  n  =  0  then  0  else  /„_i/n 

F(cz) 

c"/» 

Hence,  if  we  put 


F(»)  _  £  w  =  ^ 


n^O 


GW  =  E 


n£0 


we  obtain 


n.) .  cm -  u--)*,  _  2a)  -  S  I>  -  v‘*'- 


Decomposing  z/((l  -  z)*(l  -  2z))  into  partial  fractions  we  get 


z  —2  z  t  2 

(1  -  z)*(l  -  2  z)  “  I -z  "  (1  -  z)*  +  1  -  2z’ 


1  each 


ri  first  try  to 
hod  discussed 
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and  thus  the  table  above  provides  the  answer 


£(n  -  *)2'  =  2n+l  -  n  -  2. 

i=0 


1.4.4.  Domain  and  Range  Transformations. 


A  real-valued  sequence  (/„)„>o  is  a  mapping  from  N  to  the  reals.  Thus  a  transformation  on  the 
values  of  the  sequence  is  called  a  range  transformation  and  a  transformation  on  the  indices  a  domain 
transformation.  We  first  show  an  example  of  a  range  transformation.  Suppose  we  are  to  solve  the 
recurrence 

fn  =/n-l  ■  fn-t,  for  n  >  2, 

A  =2,  /o  =  I- 


If  we  let 


we  may  rewrite  the  recurrence  as 


9n  =  log/„, 


9n  =  9n-l  +  ffn-2,  for  fl  >  2, 
Si  =1.  So  =  0. 


For  this  recurrence,  we  know  the  solution  (see  1.4.2.;  g„  is  the  n-th  Fibonacci  number  F„)  and  substituting 
back  yields 

fn  =  2F*. 

A  domain  transformation  is  conveniently  used  in  the  following  example.  Let 

In  =3 /„/*  +  n,  for  n  =  2*  >  1, 
ft  =L 

Here  we  substitute  the  index  and  define  . 

9k  =  /2‘» 

from  which  we  get  the  following  secondary  recurrence  for  the  gF- 

9k  =3s*-i  +  2*,  for  *  >  1, 

So  =1> 

which  can  be  solved  with  methods  discussed  earlier.  We  obtain  gk  =  3t+1  -  2k+1,  and  therefore 


/„  =  3nlosJ  -  2n. 


This  concludes  our  discussion  of  several  very  useful  techniques  to  solve  recurrence  relations  for 
sequences.  Additional  material  can  be  found  for  instance  in  [Mil60,Rei77,Rio58,Rio68]. 


Chapter  2 


Higher  Level  Data  Structures 


2.1.  Basic  Set  Operations. 

It  is  often  useful  to  design  algorithms  in  layers.  That  is,  first  the  algorithm  is  described  in  a  high  level 
way  with  abstract  structures  that  are  described  in  a  fashion  that  is  independent  of  their  implementation. 
Then  the  implementation  details  of  the  structures  are  worked  out  or  discovered  in  the  literature. 

It  is  necessary  to  talk  about  the  specifications  rigorously.  To  do  this,  the  language  of  set  theory  is 
used.  There  are  several  operations  that  are  often  performed  on  data  structures.  These  involve  a,  ait a2, . . . 
as  data  elements  and  S,  S,,  S2, .  ■ .  as  sets  of  data  elements,  lhc  operations  arc: 

1.  Member  (a,  S):  is  a  €  S  ? 

2.  Insert  (a,  S):  S*~S  (Jo. 

3.  Delete  (a,  S)\  S*-S  —  a. 

4.  Replace  (a,  o',  5):  S*~(S  -  a)  U  a'- 

5.  Union  (5„  5>):  Si*-Sj  U  S,-. 

6.  Find  (a):  if  a  €  US.  then  i  such  that  a  £  Si  otherwise  undefined. 

If  in  addition  there  is  a  totally  ordered  universal  set  which  contains  all  possible  data  elements,  the 
following  operations  make  sense. 

7.  Min  (S):  if  S  =  0  then  undefined  else  min {6;  h  £  S}. 

8.  Max  (5):  if  5  =  0  then  undefined  else  max {6;  b  £  S}. 

9.  Split  (S,,  a,  Sjj:  if  a  £  5,  then  £,«-{&  €  5,;  b  >  a};  S(*~{b  £  5,;  b  <  a},  otherwise  undefined. 

10.  Concatenate  (Sit  Sj):  if  max(Sj)  <  min(Sj)  then  S,*— S,  U  else  undefined. 

It  is  ususally  necessary  to  be  able  to  perform  several  of  these  operations  on  the  same  sets.  Some  of 
these  combinations  of  operations  have  names  and  well  know  implementations. 


name 

supported  operations 

implementations 

dictionary 

member,  insert,  delete 

hash  table,  balanced  trees 

priority  queue 

1  insert  delete,  min 

balanced,  leftist  trees,  heap,  binomial  forest 

mcrgcablc  heap 

insert,  delete,  min,  union 

2-3  tree,  binomial  forest,  leftist  tree 

concatcnable  queue 

|  insert  delete,  split  concatenate 

2-3  tree 

Here  “balanced  tree”  means  any  of  several  balanced  tree  schemes,  including  2-3  trees,  AVI.  trees, 
and  RB-trees. 
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2.2.  Binomial  Queues. 


2.2.1.  Definitions. 

Binomial  queues  are  used  to  implement  priority  queues  and  mergeable  heaps.  These  queues  are 
based  on  binomial  trees.  Binomial  trees  are  defined  recursively.  The  smallest  one,  B0  is  a  single  node. 
In  general,  Bn  consists  of  two  Z?n_,’s,  such  that  the  root  of  one  is  made  the  son  of  the  root  of  the  other. 
A  picture  should  help. 


There  are  two  ways  of  thinking  of  binomial  trees,  depending  on  whether  the  sons  of  the  root  or  the 
longest  path  from  the  root  to  a  leaf  is  bcifig  concentrated  on.  This  path  is  called  the  handle.  There  is 
one  tree  of  each  size  on  the  handle.  Again  a  picture  should  help. 


A  binomial  tree  has  the  following  properties: 


|B»|  =  2" 

hexght(Bn )  =  n 


2.2.  Binomial  Queues. 
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there  are 


nodes  at  depth  k 


We  have  seen  how  to  have  a  number  of  nodes  in  a  tree  that  is  a  power  of  2.  It  is  necessary  to  allow 
structures  with  an  arbitrary  number  of  nodes,  n.  Consider  the  binary  expansion  of  n,  n  =  £  6*2*.  Then 
the  binomial  forest,  F„,  of  n  nodes,  includes  a  Bt  if  and  only  if  fc,  =  1. 

A  binomial  queue  is  a  binomial  forest  with  a  key,  K,  attached  to  every  node,  t,  so  that  the  “heap 
condition”  is  satisfied.  The  “heap  condition”  states  that  if  node  j  is  a  descendent  of  node  i  then 
Ki  <  Kj. 


2.2.2.  Union. 

Now  it  is  necessary  to  consider  how  the  operations  arc  implemented.  First  consider  the  operation  of 
union.  That  is,  there  arc  two  forests,  F,  and  F:  that  need  to  be  merged.  There  are  two  cases  to  consider: 

a)  If  t  and  j  are  powers  of  2  then  if  i  j  the  result  is  simply  the  two  trees.  Otherwise,  the  two  trees 
must  be  joined.  This  is  done  by  having  the  root  with  the  smaller  key  become  the  father  of  the  other 
root. 

b)  In  general,  a  process  like  that  of  binary  addition  is  needed.  Each  size  of  tree  from  smallest  to  largest 
is  considered.  Sometimes  there  is  a  side  tree  called  the  carry  present  Initially  it  is  absent  For 
any  size  there  are  from  zero  to  three  trees  present  possibly  a  carry  and  possibly  one  tree  from  each 
forest.  If  there  are  zero  trees  present  then  there  is  no  tree  of  that  size  in  the  result  and  no  carry  for 
the  next  stage.  If  there  is  exactly  one  tree  present  that  tree  is  present  in  the  result  and  there  is  no 
carry.  If  two  trees  are  present  then  they  are  merged  as  in  case  (a)  to  form  the  carry.  There  is  no- 
tree  of  that  size  present  in  the  answer.  If  three  trees  are  present  then  two  arc  merged  to  form  the 
carry  and  the  third  is  present  in  the  answer.  The  cost  of  this  operation  is  bounded  by  the  number 
of  trees  in  the  forests.  Therefore  the  cost  of  union(/\,  Ff)  is  0(log(i)  +  log(j)). 

2.2.3.  Insertion. 

The  operation  of  insertion  is  conceptually  the  same  as  union  with  a  one  element  forest  There  are, 
however,  subtleties  to  the  analysis  of  the  cost  that  make  it  worth  while  to  be  considered  separately.  Since 
one  of  the  forests  consists  of  one  node,  once  the  chain  of  carries  stops,  the  rest  of  the  remaining  forest 
can  be  added  in  constant  time.  Therefore,  the  cost  of  adding  n  nodes  to  a  binomial  queue  is  0(n  +  c), 
where  c  is  the  number  of  trees  merged.  Each  edge  in  the  forest  represents  the  fact  that  two  trees  have 
been  merged.  The  number  of  edges  in  Fm  is  m  —  w(m),  where  w(rn )  is  the  number  of  l’s  in  the  binary 
representation  of  m  and  the  number  of  trees  in  Fm.  Therefore  c  =  ( m  +  n  —  w(m  +  n))~(m  —  w(m ))  = 
n  +  (w(m  +  n)  —  w(m)).  Since  0  <  w(j)  <  log(j),  c  <  n  +  log(n  +  m).  Therefore,  the  cost  of  doing 
n  insertions  into  an  m  node  forest  is  0(n  4-  log(n  +  m)) 

2.2.4.  Min. 

To  find  the  smallest  clement  in  a  forest,  it  is  necessary  to  look  at  the  root  of  each  tree  to  find  the 
smallest,  '('here  are  w{n)  =0(log(n))  of  these. 

2.2.5.  Deletion. 

The  deletion  operation  is  the  most  complicated.  The  node  that  is  to  be  deleted  is  in  some  tree, 
Bi.  The  deletion  operation  consists  of  somehow  decomposing  this  tree  into  a  forest  consisting  of  the 
node  to  be  deleted  and  one  tree  of  each  smaller  size.  This  forest  is  then  merged  with  the  original  forest 
with  Bj  deleted.  When  considering  the  decomposition  of  //,,  there  are  two  cases,  either  the  node  to  be 
removed  is  the  root  or  it  is  not.  If  the  node  is  the  root,  then  die  second  way  of  looking  at  binomial  trees 
determines  the  decomposition.  If  the  node  to  be  deleted  is  not  the  root  it  is  a  member  of  some  B ,  on 
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the  handle.  The  trees  below  By  on  the  handle  form  a  By.  That  tree  and  the  trees  above  By  are  part  of 
the  decomposition.  The  trees  smaller  than  By  in  the  decomposition  are  found  by  recursively  deleting  the 
node  from  By.  Each  tree  can  be  found  in  unit  time.  Therefore,  the  time  to  delete  a  node  from  Fn  is 
0(log(n)).  Note  that  for  this  to  be  done  the  node  must  first  be  found.  A  dictionary  is  useful  for  doing 
this. 

The  details  of  how  to  represent  binomial  trees  are  covered  in  the  homework.  For  an  extended 
discussion,  also  see  [Bro78]. 

Some  applications  of  binomial  queues  include  scheduling,  discrete  simulation,  sorting,  optimal  code 
construction,  shortest  path  algorithms,  minimum  cost  spanning  tree  algorithms.  Binomial  trees  are 
important  in  a  linear  median  algorithm  and  other  places. 


Chapter  3 


Selection  —  the  Median  Problem 


Suppose  we  are  given  a  totally  ordered  universe  (U ,  <)  and  we  want  to  find,  for  some  i  with 
1  <  *  <  n,  the  i-th  largest  element  of  a  given  finite  S  C  U  with  n  elements.  For »'  =  1,  this  is  the 
problem  of  finding  the  minimum  (resp.,  maximum,  if  we  reverse  the  order)  of  a  set  It  can  (and  in  sports 
usually  is)  solved  by  an  l nock  out  tournament ,  and  it  is  not  hard  to  see  that  this  method  minimizes  the 
number  of  comparisons  (i.e.,  the  number  of  “matches").  However,  when  it  comes  to  determining  the 
vice  champion,  this  need  not  be,  by  any  means,  the  loser  of  the  final  match.  As  a  matter  of  fact  (though 
this  is  not  commonly  realized  in  the  world  of  sports),  it  might  be  any  one  of  the  participants  in  the 
tournament  who  was  thrown  out  by  losing  against  the  final  winner  (of  course,  we  assume  here  transitivity 
of  the  “is  a  better  player  than”  relation).  And  it  was  also  Jiis  apparent  injustice  which  might  have  led  to 
the  formulation  of  the  first  selection  algorithm  (Car83)  (for  »  l,n). 

Another  variant  of  the  selection  problem  which  deserves  specific  mentioning  is  the  so-called  median 
problem.  It  calls  for  the  determination  of  the  fn/2]-th  largest  clement  and  is  of  practical  importance 
whenever  we  want  to  split  a  set  into  two  equal  sized  parts  such  that  the  elements  in  the  one  part  are  all 
smaller  and  those  in  the  other  all  bigger  than  the  splitting  element  (we  may  assume  here  without  loss  of 
generality  that  »  is  odd). 

In  the  sequel,  we  are  going  to  discuss  two  algorithms  for  selection  problems.  For  the  analysis  of 
these  algorithms,  we  choose  to  take  the  number  of  comparisons  made  by  the  algorithm  between  elements 
of  the  set,  as  the  dominant  part  of  the  overall  complexity.  In  these  terms,  both  algorithms  turn  out  to  be 
of  linear  complexity. 


3.1.  The  Blum-Floyd-Pratt-Rivcst-Tarjan  Selection  Algorithm. 

Let  m  be  a  small,  odd  number  (say,  5  <  m  <  15),  and  consider  the  following  algorithm  for 
selecting  the  t'-th  smallest  element  of  S  for  any  »  between  1  and  r». 

Linear  Selection  Algorithm  I: 

1.  Divide  S  into  fn/m ]  subsets,  |n/mj  of  them  with  m  elements. 

2.  Sort  these  (n/mj  subsets;  if  (n/mj  is  even,  alSv,  sort  the  remaining  subset  minus  one  clement.  Let  S' 
be  the  set  of  the  medians  of  these  sorted  subsets. 

3.  Determine  recursively  the  f|5'j/2]-th  element  of  S',  call  it  a. 

4.  Divide  S  into  :=  {a'  €  S; »'  <  a}  and  52  :=  {»'  €  S-, »’  >  a} 

(note  that  |S||,  |S2|  >  n/4  if  we  assume,  and  we  do  this  without  loss  of  generality,  that  n  >  3m  -  1). 

5.  If  Si  >  i  determine  recursively  the  (-smallest  clement  of  $r, 

if  St  >  n-i  recursively  determine  the  t  -  |S(|  -  1 -smallest  clement  of  Sf; 
otherwise  return  a. 

The  worst-case  complexity  T  of  this  algorithm  (measured  in  number  of  comparisons)  hence  is 
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bounded  by 

T(n)  <  r(fn/m1)  +  r(l3r»/‘1J)  +  •  Cm  +  L«/2J  +  rn, 

where  Cm  is  the  number  of  comparisons  needed  to  sort  m  elements  (using,  say,  the  Ford-Johnson  sorting 
algorithm).  By  induction  we  obtain 


T(n)  <  emn  with  cm  »  20. 


For  further  details  see  [BFP73J. 


3.2.  The  Schonhage-Paterson-Pippenger  Median  Algorithm. 

This  algorithm  is  the  best  algorithm  known  so  far  with  respect  to  its  asymptotic  complexity  (expressed 
in  number  of  comparisons  used  by  the  algorithm).  It  makes  essential  use  of  binomial  trees.  It  is  also  of 
interest  because  it  is  not  recursive  (based  on  a  divide-and-conquer  approach)  but  rather  is  best  described 
as  a  pipelined  algorithm. 

Definition: 

Let  Sk  denote  the  partial  order  consisting  of  2k  +  1  elements  k  of  which  are  smaller,  and  k  of  which 
are  bigger  than  the  remaining  element,  which  is  called  the  center  of  Sk. 

We  are  now  going  to  define  a  special  variant  of  binomial  trees  with  one  distinguished  node,  called 
the  center.  The  nodes  of  the  trees  are  elements  from  a  totally  ordered  universe. 

Definition: 

(a)  The  tree  H0  is  a  singleton  node,  which  is  also  its  center. 

(b)  The  tree  Ht  is  obtained  by  combining  two  H0' s  by  an  edge;  its  center  is  the  smaller  of  the  H0’ s. 

(c)  The  tree  Ht  is  obtained  by  combining  two  H,’s  by  an  edge;  its  center  is  the  bigger  of  the  centers 
of  the  two  Hi's. 

(d)  The  tree  Hik.  for  h  >  0,  is  obtained  by  combining  two  Htk-  i’s  by  an  edge;  its  center  is  the  smaller 
of  the  centers  of  the  two  constituents. 

(e)  The  tree  Hik+l,  for  h  >  0,  is  obtained  by  combining  two  H2k’s  by  an  edge;  its  center  is  the  bigger 
of  the  centers  of  the  two  constituent  trees. 

(Note  that  there  is  an  irregularity  in  the  definition  of  the  Ht  just  for  small  t;  H0,  Hit  Hit ...,  Htk+ i,  •  . . 
are  of  the  same  “variety”,  and  so  are  Ht,Ht,H4,...,Hth,....) 

We  now  state  those  properties  of  the  Hh  which  will  be  needed  in  the  algorithm,  in  the  following 

Decomposhioii  Lemma: 

(a)  Hk  has  2k  nodes,  exactly  2*  -  1  comparisons  arc  needed  to  produce  it  (because  it  is  a  tree!). 

(b)  For  h  >  1,  HXk  can  be  decomposed  into 

-its  center, 

-a  (disjoint)  collection  {Ho,Hi,Ht,...,Hth-i}  of  subtrees  all  of  whose  centers  are  above  the 
center  of  Hth,  and 

-a  collection  {Hit  Ht,  Ht, . . . ,  Hik-t}  of  subtrees  all  of  whose  centers  are  below  the  center  of 

Htk. 

(c)  Hr  can  be  split  such  that  the  component  of  the  center  contains  exactly  elements  all  at  or 

o)  above  the  center  if  r  =  0,3, 5,7, ..., 

0)  below  the  center  if  r  =  1,2, 4,8,..., 
by  removing  at  most  f  •  2l,/,J  edges. 


3.2.  The  Schonhace-Paterson-Pippenger  Median  algorithm. 
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(d)  If  fc  <  2h  —  1  then  //2h  can  be  decomposed  such  that  the  :omponent  of  the  center  contains  2k  +  1 
elements,  k  of  which  are  bigger,  and  k  of  which  are  smaller  than  the  center  (i.e.,  the  component 
contains  Sk)\  for  this,  at  most  3Jk  +  2h  edges  have  to  be  cut,  and  the  remaining  components  are  of 
the  form  H„  a  <  2  h. 

The  proof  of  this  Decomposition  Lemma  is  left  as  a  homework  problem.  For  further  details,  also 
see  [SPP76], 

Now  suppose  that  we  have  a  chain  of  t  pairwise  disjoint  Sks  whose  nodes  are  taken  from  some  set 
with  n  elements,  and  let  r  be  the  number  of  leftover  elements  (i.e.,  n  —  (2k  +  1)1  +  r).  Also  assume 
that  the  centers  of  the  S*’s  appear  in  the  chain  in  increasing  order,  and  let  top  (bot)  be  the  last  (first) 
center  in  the  chain.  Under  the  assumption  that  r  is  less  than  t  -  1,  we  can  then  conclude  that  top  is 
known  to  be  bigger  than 

k  +  (t-i)(k  +  !)>*  +  (*  +  !)(  ^±l_i)=!Lzi. 

Hence  top  and  the  elements  known  to  be  bigger  than  top  arc  also  known  to  be  bigger  than  the  median 
of  the  n  element  set  A  corresponding  situation  applies  to  bot.  'ITius,  2(fc  +  1)  elements  can  be  discarded 
from  the  set  and  the  median  determined  as  the  median  of  the  remainder. 

The  (basic  version  of  the)  SPP  Median  Algorithm  is  shown  in  the  diagram  on  the  next  page.  There, 
n  is  the  number  of  elements  for  which  we  search  the  median,  k  is  set  to  [n1/4J,  and  h  is  chosen  such 
that  2h~l  <  k  <  2*. 

The  algorithm  maintains  two  pools,  one  of  which  contains  H,' s,  the  other  just  singletons  (the  two 
pools  actually  could  be  merged;  for  subsequent  improvements  of  the  basic  algorithm,  it  is  however 
convenient  to  have  them  separate).  Initially  all  elements  of  the  set  are  in  this  second  pool.  The  first 
process  in  the  algorithm  recursively  forms  //2*-trees,  whenever  there  is  enough  building  material  available 
in  the  two  pools.  This  means  in  particular  that,  whenever  there  are  two  H.'s  with  the  same  s  in  the  pools, 
they  are  combined  to  produce  an  H,+l.  Whenever  an  II2h  gets  produced  by  this  formation  process, 
it  is  decomposed  into  a  component  containing  Sk  and  various  //.’s  (which  arc  being  recycled  in  their 
pool)  according  to  the  Decomposition  Lemma  stated  above.  The  Sk  -component  is  then  inserted  into  a 
chain  of  such  components  in  a  way  such  that  their  centers  are  in  increasing  order.  Whenever  none  of 
the  processes  discussed  so  far  can  run  any  more,  one  of  two  other  processes  is  started,  depending  on  the 
relation  between  the  length  t  of  the  Sk  -chain  and  the  number  r  of  elements  which  arc  leftover  elements 
outside  of  the  chain.  If  t  —  1  >  r  then,  as  argued  above,  2k +2  extremal  elements  in  the  chain  can  be 
discarded  because  they  cannot  possibly  be  the  median,  and  as  half  of  them  lie  below  and  half  of  them 
lie  above  the  original  median,  we  are  still  left  with  determining  the  median  of  the  remaining  elements.  If 
4—1  <  r  the  following  analysis  will  show  that  the  number  of  elements  remaining  in  the  whole  process 
is  small  enough  so  we  can  sort  them  and  thus  determine  their  median. 

Analysis  of  the  SPP  Median  Algorithm. 

Consider  the  following  quantities: 

r  :=  the  number  of  leftover  elements  (elements  which  are  stuck  in  the  recursive  Hth  formation  process 
respectively  its  input  pools).  Note  that  at  most  one  II,  can  be  left  over  for  every  a  <  2 h.  and  as 
II,  contains  2*  elements,  we  have 

r<2tk-  1. 

R  :=?  the  number  of  elements  to  be  sorted  in  the  final  step  of  the  algorithm;  these  are  the  elements  in 
the  chain  when  this  last  step  is  invoked,  and  the  leftover  elements.  As  t  <  r  +  1  in  this  case,  we 
have 


R  =  (2Jk  +  1)1  +  r  <  2th(2k  +  1)  +  2*'*  -  1. 


3.  SELECTION  —  THE  MEDIAN  PROBLEM 


3.3.  A  lower  Bound  For  tiie  Median  Problem. 
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m  :=  the  total  number  of  5* -components  produced  during  the  whole  process.  As,  whenever  2k  +  2 
elements  are  discarded  from  the  process,  two  Sk  -components  arc  destroyed,  m  equals  twice  the 
number  of  such  discarding  actions  plus  the  number  t  of  S* -components  left  at  die  end.  Hence, 


m 


2- 


n-R 
2Jk  +  2 


+ 1  = 


n-R 
k  +  1 


+  t* 


We  are  now  able  to  count  T(n),  the  total  number  of  comparisons  made  by  the  algorithm.  As 
the  algorithm  always  preserves  acyclicity  of  all  graphs  involved.  T(n)  is  equal  to  the  number  of  edges 
generated  during  the  algorithm.  And  this  number  in  turn  is  bounded  above  by 
the  number  of  edges  in  all  S*  -components 

plus  the  total  number  of  edges  removed  in  the  decomposition  process 

plus  the  number  of  edges  left  at  the  end  in  the  pools  of  the  //4k- formation  process, 

plus  the  total  number  of  comparisons  for  the  chain  insertion  process, 

plus  the  number  of  comparisons  for  the  final  sorting  process. 

Together  this  yields 

T(n)  <  +  t)[2k  +  3*  +  2A  +  log(n/(2*  +  I))J  +  RlogR  +  r. 

*  T  l 

As  k  =  |nl/4J,  and  2k~l  <  k  <  2h,  wc  have 

r  =0(**); 
t  =0(k*); 

R  =0(**),  and  hence 


T[n)  <  5n  +  o(n). 

Some  improvements  are  possible  for  this  basic  version  of  the  algorithm.  Their  description  would  be 
very  laborious,  and  the  reader  is  referred  to  [SPP76J  for  the  details.  Implementing  these  changes,  we 
obtain  a  median  algorithm  whose  asymptotic  complexity  in  terms  of  number  of  comparisons  is  3n  +  o(n), 
the  best  known  upper  bound  so  far. 


Theorem: 

The  median  of  a  set  of  n  elements  can  be  determined  using  at  most  3n  +  o(n)  comparisons. 


33.  A  Lower  Bound  For  the  Median  Problem. 

We  have  seen  that  it  is  possible  to  compute  the  median  of  an  n-elcmcnt  set  in  linear  time.  The 
question  of  how  many  comparisions  are  needed  to  do  this  naturally  arises.  Tnc  best  published  result  of 
7 /An  —  o(n)  is  due  to  Pratt  and  F.  Yao  (PFY73J.  Here  a  weaker  result  due  to  Blum,  Floyd,  Rivcst,  Pratt, 
and  Tatjan  [BFP73]  will  be  discussed. 

Theorem: 

Finding  the  median  of  an  n  element  set  requires  at  least  f3n/2]  -  2  comparisons. 

Proof:  The  proof  of  this  theorem  uses  what  is  called  an  adversary  argument  An  algorithm  A  asks 
questions  of  the  form:  “is  x  <  yV  The  adversary,  B,  gives  answers  so  that  all  of  its  answers  are 
consistent  and  A  has  to  ask  “many”  questions. 

It  B  necessary  to  describe  B  in  greater  detail.  Suppose  n  is  odd  and  m  :=  fn/2").  B  maintains 
three  sets  of  elements:  U,  L,  G.  B  decides  that  all  of  the  elements  in  G  arc  greater  than  die  median  and 
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3.  Selection  —  the  Median  problem 


that  all  of  the  elements  of  L  are  less  than  the  median.  The  elements  of  U  are  candidates  for  being  the 
median.  B  also  ensures  that  if  g  €  G,  l  €  L,  and  u  6  U  then  g  >  u  >  /. 

Basic  observation: 

As  long  as  \L\,  |G|  <  fn/2]  it  is  consistent  for  B  to  assume  that  the  median  is  in  U.  If  in  addition, 
there  are  two  incomparable  elements  in  U,  A  has  cannot  have  determined  the  median  yet 

Thus,  B  will  try  to  keep  L  and  G  small  and  U  big.  For  simplicity,  however,  B  will  keep  in  U  only 
those  elements  which  have  been  compared  to  at  most  one  other  element  currently  in  U.  If  this  condition 
is  violated  due  to  a  question  from  A,  appropriate  elements  are  removed  from  U  and  put  into  L  or  G  so 
as  to  restore  this  condition.  B  always  answers  such  questions  so  that  there  are  never  three  elements  of  U, 
a,  b,  c  such  that  a  is  known  to  be  less  than  b  and  b  is  known  to  be  less  than  c. 

First  phase  of  the  adversary  algorithm. 

Initially,  L  =  G  —  0,  and  U  contains  all  n  elements.  If  A  asks  a  question  “Is  x  <  y?”,  and  x  and  y 
are  not  both  in  U  B  answers  it  so  that  it  is  consistent  and  the  invariance  L“  U,  G 

is  maintained.  From  B’s  point  of  view  this  is  a  redundant  comparison.  If  x  and  y  are  both  in  U  then  B 
answers  the  question  depending  on  which  of  x  and  y,  if  any,  have  been  compared  and  what  the  result 
was.  The  six  cases  are  drawn  below.  A  line  between  two  elements  indicates  that  the  higher  one  is  known 
by  A  to  be  greater  than  the  lower  one.  A  dotted  line  represents  the  question  A  is  asking. 


state  of  x,y  B’s  action  number  of  pairs  |(/|  |L|  |G| 


3.4.  References. 


19 


If  C  is  the  number  of  comparisons  made  so  far,  and  P  is  the  number  of  pairs  in  U,  then  it  is  obvious 
by  induction  that : 


C  —  P  +  2\U\  >  2 n. 


Second  phase  of  the  adversary  algorithm. 

The  first  phase  of  the  adversary  algorithm  stops  if  either  \L\  =  [n/2j,  \G\  =  |n/2j,  or  there  are  not 
two  incomparable  elements  in  U .  Suppose  the  latter  happens.  Then  |{/|  <  2,  so  C  —  P  +  2(2)  >  2 n. 
This  means  that  C  >  2n  —  3. 

If,  on  the  other  hand,  L  or  G  gets  too  large,  B  enters  a  second  phase.  Suppose  without  loss  of 
generality,  that  it  was  L  that  became  to  big.  B  will  then  force  A  to  find  the  minimum  element  of  U , 
which  will  be  the  median.  This  phase  will  require  an  additional  \U\  —  P—  1  comparisons,  because  \U\  —  1 
comparisons  are  ncessary  to  find  the  minimum  of  \U\  elements  and  at  most  P  of  the  comparisons  have 
been  done.  Rearranging  the  invariant. 


C  >  2n  +  P  -  2\U\ 

.  This  means  that  the  total  number  of  comparisons  for  both  phases  satisfies 

C  +  \U\  -  P  -  1  >  2n  +  P  -  2\U\  +  \U\  -  P  -  I  =  2n  -  \U\ -  l. 
Sihce  L  —  |n/2j,  U  <  [n/2].  Therefore, 

C  +  \U\-P-\  >  fyl-2, 


which  completes  the  proof.  | 


3.4.  References. 

Expected  time  bounds  are  discussed  in  [F1R75J.  Reiser  [Rei78]  discusses  the  selection  problem  when 
the  elements  have  weights.  Further  lower  bounds  for  the  selection  problem  are  proven  in  [Hya76], 


Chapter  4 


Minimum  Spanning  Trees 


4.1.  A  Schema  for  Minimum  Spanning  Tree  Algorithms. 

Let  G  =  [V ,  E)  be  a  connected  undirected  graph  with  |V|  =  n  nodes  and  |E|  =  e  edges,  and  let 
c(v,w)  >  0  be  a  cost  value  attached  to  (v,  w)  g  E. 

Definition: 

A  minimum  spanning  tree  for  ( G ,  c)  is  any  tree  T  =  (V,  E')  with  the  same  node  set  as  G,  and  with 
edge  set  E'  C  E  such  that  c(vi®)  is  minimum. 

First  basic  observation : 

Let  V'  C  V,  and  let  (v,w)  be  an  edge  in  E,  such  that  v  6  V'  and  w  6  V  -  V'  and  such  that 
c(v,  to)  is  minimum  with  respect  to  all  edges  having  the  first  property.  Then  some  minimum  spanning 
tree  for  (G,  c)  contains  (v,  w). 

Second  basic  observation: 

Let  ( v ,  to)  be  an  edge  on  some  cycle  C  in  G  such  that  c(v,  w)  is  maximum  for  all  the  edges  on  C. 
Then  some  minimum  spanning  tree  for  (G,c)  docs  not  contain  (v,  w). 

The  proof  of  these  two  properties  is  left  as  an  easy  exercise.  Though  both  of  them  can  be  used  to 
construct  efficient  algorithms  for  finding  minimum  spanning  trees,  we  shall,  in  the  sequel,  only  employ 
the  first  From  it  we  can  immediately  derive  the  following  very  general  frame  for  minimum  spanning 
tree  algorithms: 


1.  Initialize  a  forest  F  of  n  trees,  each  of  them  a  singleton  node  from  V. 

Repeat  as  long  as  F  has  more  than  one  tree: 

2.  select  a  tree  2’  from  F\ 

3.  find  an  edge  (v,  w)  with  minimal  c(v,  to)  such  that  v  is  in  T  and  w  is 

not;  let  V  be  w’s  tree; 

4.  update  F  by  combining  T,  T\  and  (v,  ta)  into  a  single  tree. 

There  arc  a  number  of  possibilities  to  implement  the  forest  F,  to  choose  criteria  for  the  selection  of 
a  tree  T  in  step  2,  and  to  facilitate  the  search  operations  in  step  3.  We  arc  going  to  discuss  two  solutions. 
The  first  one  is  very  simple  and  easy  to  program,  whereas  the  second  has  an  asymptotic  complexity  which 
is  the  best  known  so  far  for  the  general  case  (better  upper  bounds  arc  known  if  we  restrict  the  class  of 
graphs  being  considered,  e.g.,  to  planar  graphs). 


4.3.  A  Better  Minimum  Spanning  Tree  Ai.gorithm. 
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4.2.  Kruskal’s  Algorithm. 

In  Kruskal’s  algorithm  (Kru56J,  the  edges  are  first  sorted  according  to  increasing  cost,  and  then 
searched  in  this  order.  Whenever  an  edge  is  encountered  which  connects  two  different  trees,  these  two 
trees  are  combined  into  one.  For  this  last  operation,  a  simple  weighted  union  algorithm  is  used.  We 
assume  now  for  convenience  that  the  nodes  of  the  graph  G  arc  just  the  integers  between  1  and  n. 

algorithm  MST1; 
begin 

generate  a  list  L  containing  the  edges  of  G  sorted  according  to  nondecreasing  cost  value; 

initialize  an  in-forest  F,  with  the  t-th  tree  7’  consisting  of  node  t,  for  1  <  t  <  n 

co  an  in-tree  is  a  rooted  tree  with  all  edges  pointing  towards  the  root  oc; 

initialize  M  ST  to  the  empty  tree; 

for  i  :=  1  to  lertglh(L)  do 

begin 

(v,w):  =  t-th  clement  of  L\ 

x  :=  j  where  j  such  that  v  is  a  node  in  the  tree  Ti  of  F\ 
y  :=  j  where  j  such  that  w  is  a  node  in  the  tree  T,  of  7'’; 

if  x  y  then 
begin 

M ST  :=  Af5TU{(t',w)}; 

•f  |2’i|  <  |7„|  then  UN10N(7’l,,7'l)  co  making  the  root  of  Tz  a  new  son  of  the  root  of  Ty  oc 
else  UNION(7;,7;) 

end 

end 

end  MST1. 

Time  complexity  analysis  for  MSTI: 

1.  The  initial  sorting  of  the  edges  takes  time  0(eloge)  =  B(elogn). 

2.  1'hc  UNION  operations  can  obviously  be  done  each  in  time  0(1). 

In  order  to  analyse  the  complexity  of  the  FIND  operations  which  determine  the  tree  some  node  is 
currently  in,  we  notice  that  at  all  time  steps  and  for  all  trees  T  in  the  forest  F,  hcight[T)  <  log|T|. 
This  is  certainly  true  right  after  the  initialization  of  F.  When  two  trees  T  and  V  are  combined  (call  the 
result  for  the  moment  U  and  assume  without  loss  of  generality  that  \T\  <  |7V|),  we  have 
height(T")  <m*x{height(T'),  height(T)  +  1}  <  max{log|7v|,  1  +  log|T|} 

<log(|r|  +  |T'|)  =  log|T"|. 

3.  It  follows  hence  that  the  FIND  operations  can  be  done  in  time  O(logn)  each. 

Together,  we  obtain  0(elogn)  as  an  upper  and  also  as  a  worst-case  lower  bound  for  the  running 
time  of  MSTI. 


4.3.  A  Better  Minimum  Spanning  Tree  Algorithm. 

We  arc  able  to  improve  on  the  complexity  of  the  previous  algorithm  by  applying  some  care  in  the 
selection  of  the  tree  in  step  2  of  the  general  algorithm,  flic  selection  rule  which  we  arc  going  to  discuss 
is  called  unifortn  selection  (used  by  Sollin,  sec  ( I1GH56)),  and  is  specified  as  follows: 

Initially,  all  of  the  n  trees  (each  consisting  of  a  singleton  node)  arc  placed  on  a  queue. 

Also,  the  stage  of  each  of  these  trees  is  defined  to  be  0. 

In  step  2  of  the  general  algorithm,  the  tree  T  at  the  front  end  of  the  queue  is  selected  (it  will  always  have 
a  minimal  stage  among  all  the  trees  in  the  queue).  When  it  is  combined  with  some  tree  T'  in  the 
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queue,  both  T  and  T'  are  removed  from  the  queue,  and  the  combined  tree  is  inserted  at  the  end  of  the 
queue.  Its  stage  is  defined  to  be  one  more  than  the  stage  of  T  (which  is  the  minimum  of  the  stages  of 
T  and  T'). 

Basic  observations  about  uniform  selection: 

a)  The  stage  of  any  tree  in  the  queue  is  <  log  n. 

b)  At  any  time  step  of  the  algorithm,  all  trees  in  the  queue  with  minimal  stage  are  pairwise  disjoint. 
The  first  fact  comes  from  the  observation  that,  in  order  to  create  an  additional  tree  of  stage  i  +  1 ,  it 

takes  two  trees  of  stage  i.  For  the  second  property,  one  can  easily  prove  by  induction  that  at  every  time 
step  the  trees  in  the  queue  form  a  partition  of  the  node  set  of  the  graph  C. 

(An  alternative  tree  selection  rule  is  called  smallest  tree  selection.  Under  it.  the  trees  arc  also 
organized  in  a  queue,  but  at  every  step  a  tree  with  a  minimal  number  of  nodes  is  chosen.  In  an  efficient 
implementation,  one  has  to  take  care  that  such  a  minimal  size  tree  can  be  found  quickly,  and  that  the 
combined  tree  can  easily  be  reinserted.  It  would  also  be  possible  to  use  this  selection  rule  in  the  following 
algorithm,  but  a  somewhat  bigger  programming  effort  would  result.] 

For  step  3  of  the  general  algorithm,  we  implement  die  set  of  edges  incident  to  some  tree  T  (which 
in  turn  is  represented  by  its  set  of  nodes)  as  a  collection  of  ordered  subsets  of  size  <  k,  where  k  will 
be  specified  later.  Specifically,  if  there  are  m  such  edges,  we  divide  them  into  [m/fc]  subsets,  [m/fcj  of 
them  of  size  fc,  and  sort  all  these  subsets  in  order  of  nondecreasing  cost.  We  also  attach  to  the  head  of 
each  subset  a  pointer  to  the  tree  to  which  it  belongs,  and  a  list  pointer  pointing  to  the  smallest  cost  edge 
which  has  not  yet  been  searched.  Furthermore  we  assume  that  from  every  edge  (v,  v>)  there  arc  pointers 
to  the  head  of  its  subset  and  to  its  twin  representation  ( w ,  v)  (occurring  in  a  list  attached  to  w's  tree). 

algorithm  MST2; 
begin 

procedure  combinc_upto_stage(a); 
begin 

while  stage  of  the  first  tree  in  the  tree  queue  is  <  a  and  the  tree  queue  has  length  >  2  do 
begin 

let  T  be  the  first  tree  in  the  queue; 

search  each  of  T’s  edge  lists  in  the  direction  of  increasing  cost,  starting  in  each  list  at  the  smallest 
edge  not  yet  searched,  nd  deleting  all  edges  until  one  is  found  that  connects  T  to  a  different  tree, 
say  T; 

add  this  edge  to  MST\ 

combine  T"s  and  V's,  edge  lists  by  updating  the  tree  pointers  in  T’s  lists  so  that  they  point  to  V\ 

remove  T  and  V  form  the  queue; 

insert  the  combined  tree  at  the  end  of  the  queue,  and 

set  its  stage  to  the  stage  of  T  plus  one 

end 

end  combine; 

MST  :=  0; 

initialize  the  tree  queue;  each  tree  consists  of  a  singleton  node  and  is  assigned  stage  0; 
for  every  such  tree,  represent  the  set  of  incident  edges  by  a  collection  of  lists  of  size  1 ; 

*  combinc_upto_stagc(log  log  log  n); 

*  for  every  tree  in  the  tree  queue,  reinitialize  the  collection  of  edge  lists  into  ordered  lists  of  size  log  log  n; 
combinc_upto_stage(log  log  n); 

for.  every  tree  in  the  tree  queue,  reinitialize  the  collection  of  edge  lists  into  ordered  lists  of  size  logn; 
combinc_upto_stagc(log  n) 

co  by  the  first  basic  observation,  this  implies  that  only  one  tree  is  left  in  the  tree  queue  oc 
end  MST2. 


4.3.  A  Bettkk  Minimum  Spanning  Tube  Algorithm. 
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Time  complexity  analysis  for  MST2: 

Define  stage  s  of  the  algorithm  to  be  the  time  span  during  which  the  trees  taken  from  the  front  end 
of  the  tree  queue  hive  stage  s,  and  note  that 

1.  if  the  edge  lists  are  (re)initializcd  right  before  stage  s,  then  during  stage  s  there  arc  at  most  n/2* 
trees  in  the  tree  queue,  and  hence  at  most  2 e./k  +  n/2*  edge  lists  for  all  trees  together; 

2.  the  time  for  executing  stage  s'  >  s  (without  intermediate  reinitialization)  is 

where  Ds  is  the  number  of  edges  deleted  during  stage  s',  because  of  the  second  basic  property 
stated  above,  the  trees  which  are  in  the  tree  queue  at  the  beginning  of  stage  s'  arc  all  pairwise 
disjoint,  and  all  edge  lists  attached  to  them  arc  searched  at  most  once  during  stage  s',  because  of 
the  uniform  selection  rule.  Also,  due  to  die  chosen  implementation,  die  trce(s)  to  which  an  edge  is 
incident  can  be  found  in  time  0(1),  and  u  time  to  combine  the  tree  T  (selected  from  the  front 
end  of  die  queue)  with  some  other  tree  T  proportional  to  the  number  of  edge  lists  attached  to 
T\ 

3.  the  time  for  (rc)initializ.ing  the  edge  lists  to  ordered  lists  of  size  k  is  bounded  by  0(e  log  k).  Hence, 

4.  the  total  time  to  (re)initializ.c  and  then  execute  from  stage  s,  to  stage  st  is  bounded  by 

O^e  log  k  4-  Q  +  -  »i  +  1)  +  ej 

where  the  first  term  accounts  for  the  initialization,  die  second  for  the  UNION  operations,  and  the 
second  and  third  together  for  the  FIND,  MIN  and  edge  deletion  operations. 

5.  Of  die  dircc  phases  of  algorithm  MST2,  the  third  phase  (between  stages  log  log  n  and  logn  produces 
the  dominating  contribution.  Substitution  of  its  parameters  in  the  above  estimate  yields 

0(e  log  logn  +  (e/  logn  +  «/2lo*lflKn)(logn  —  log  logn)  +  e). 

Theorem: 

Algondim  MST2  finds  a  minimum  spanning  tree  for  an  arbitrary  connected  undirected  graph  with 
weighted  edges  within  dmc  bounded  by  0(e  log  log  n). 

The  steps  marked  (*)  in  Algorithm  MST2  could  actually  be  omitted  without  affecting  the  growth  rate  of 
its  time  complexity  (producing,  however,  a  bigger  constant  factor  in  the  above  estimate).  Algorithm  MST2 
was  originally  discussed  in  [Ch'1’76].  This  reference  also  presents  a  number  of  alternative  implementations 
and  more  efficient  algorithms  for  special  subcases  of  the  minimum  spanning  tree  problem.  The  first 
solution  of  time  complexity  0{e  log  logn)  for  the  general  problem  was  given  in  [AYa75]. 
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Chapter  5 


Path  Compression 


5.1.  The  UNION-FIND  Problem. 

For  most  of  the  groups  of  operations  on  sets  which  we  have  seen  so  far,  time  O(logn)  per  operation 
is  required  in  the  worst  case.  If  we  consider  only  the  two  operations  UNION  and  FIND  we  can  do  better 
using  a  different  tree  structure  (namely,  in-trees). 

The  UNION-FIND  algorithm  has  numerous  applications,  e.g.  for  the  following  problems:  determina¬ 
tion  and  verification  of  minimum  spanning  trees,  depth  determination,  closure  of  equivalence  relations, 
dominators  in  flow  graphs,  and  many  others  [Tar79). 

In  general,  we  are  given  a  sequence  of  n  intermixed  UNION  and  FIND  operations  as  defined  in 
Section  2.  We  may  assume  here  without  loss  of  generality  that  the  universal  set  is  {1, . . . ,  n},  and  the 
set  names  are  also  elements  of  {l,...,n}.  Also  note  that  the  argument  sets  of  the  UNION  operations 
are  always  disjoint. 

The  basic  idea  is  to  construct  an  in- tree  (i.e.,  the  edges  point  towards  the  root  of  the  tree)  for  each  set 
made  up  of  nodes  for  each  element  in  the  set.  Rach  node  is  a  pair  consisting  of  an  integer  representing 
the  element,  and  a  pointer  to  another  pair  (its  father  in  the  tree).  For  the  root  of  the  tree,  this  pointer  is 
assumed  to  be  nil.  Without  loss  of  generality,  the  root  of  the  tree  is  identified  with  the  name  of  the  set 
The  set  operations  are  implemented  as  follows: 

-  FlND(t'):  from  t’s  node,  follow  the  chain  of  pointers  until  the  next  pointer  is  a  nil-pointer;  its  node  is 
the  name  of »’ s  set 

-  UNION(namei,name2):  if  the  two  nodes  are  different,  make  the  second  node  a  new  son  of  the  first 
node  (i.c.,  the  nil-pointer  of  name2  is  updated  to  point  to  namei). 

For  this  implementation  the  following  sequence  obviously  is  a  bad  case: 

UNION(2,  l);UNION(3, 2);. . .  ;UNION(n,  n  —  1);  n-times  FIND(1); 

From  it  we  can  immediately  derive  the  following 

Theorem: 

The  above  implementation  of  UNION-FIND  . has  time  complexity  0(n2). 


5.2.  The  Weighting  Heuristic. 

There  are  two  heuristics  which  can  be  used  to  improve  the  basic  algorithm.  The  first  is  the  weighting 
heuristic  and  is  motivated  by  the  desire  to  keep  the  trees  balanced.  We  add  to  each  root  a  counter  which 
gives  the  number  of  nodes  in  die  tree.  The  weighting  heuristic  states  that  when  two  trees  arc  joined  the 
root  of  the  one  with  the  smaller  node  count  is  made  a  new  son  of  the  root  of  the  other.  (In  ease  of  a 
draw,  we  still  make  the  root  of  the  second  tree  a  new  son  of  the  root  of  the  first  tree.)  As  we  have  proved 


5.4.  Upper  Bounds  for  UNION-FIND  with  Path  Compression. 
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already  in  the  analysis  of  the  minimum  spanning  tree  algorithm  MST1  (Kruskal’s  algorithm.  Section  4.2) 
the  height  of  a  tree  with  n  nodes  constructed  according  to  this  weighting  heuristic  is  bounded  by  log  n. 
Hence  the  FIND  operations  require  time  at  most  O(logn)  each  while  each  UNION  can  still  be  done  in 
constant  time.  On  the  other  hand,  let  n,  =  2*  be  a  power  of  2.  Then  the  sequence 


UNION(2,l);UNION(4,3);  ...  ;UNION(n,n  -  1); 

UNION(4, 2);UNION(8, 6);  ...  ;UNlON(n,  n  -  2); 

UN10N(«,  ?); 
n-times  FIND(l); 

clearly  requires  time  0(n  log  n).  Hence  we  obtain  the 
Theorem: 

The  UNION-FIND  algorithm  with  weighting  heuristic  has  time  complexity  0(nlogn). 


5.3.  The  Path  Compression  Heuristic. 

The  second  heuristic  to  improve  the  basic  UNION-FIND  algorithm  is  called  path  compression  or 
collapsing.  It  is  motivated  by  the  observation  that  in  the  bad  case  example  for  the  basic  algorithm  n 
FIND(l)  operations  arc  performed  each  requiring  time  0(n).  After  the  first  such  FIND  operation,  the 
structure  should  be  modified  in  such  a  way  that  subsequent  FIND  operations  for  the  same  clement  are 
sped  up.  This  can  be  done  by  changing  all  pointers  encountered  in  a  FIND  operation  on  the  way  up 
to  the  root,  to  point  to  that  root.  We  can  use  a  stack  to  store  all  intermediate  nodes  on  the  FIND  path, 
or  we  can  even  use  these  pointer  cells  themselves  (and  some  constant  amount  of  additional  memory)  to 
keep  track  of  the  FIND  path  (details  to  be  worked  out  as  a  homework  assignment). 

It  is  also  left  as  an  exercise  to  construct  a  special  sequence  of  UNION’S  and  FIND’s  (here  the 
UNION’S  do  not  employ  the  weighting  heuristic,  rather  the  root  of  the  second  tree  is  always  made  a  son 
of  the  root  of  the  first  tree!)  which  requires  time  0(n  log  n).  Such  a  sequence  is  not  hard  to  find  if  one 
has  in  mind  the  two  basic  ways  to  parse  binomial  trees. 


5.4.  Upper  Bounds  for  UNION-FIND  with  Path  Compression. 

The  derivation  of  good  upper  bounds  for  UNION-FIND  algorithms  which  use  the  path  compression 
heuristic  requires  some  more  effort.  Assume  that  starting  from  singleton  sets  n  —  1  UNION’S  arc 
performed,  and  m  >  n  (intermixed)  FINDS’s.  Let  T  be  the  tree  which  would  have  been  constructed  by 
the  UNION  operations  if  there  had  been  no  intermixed  F'IND's.  For  every  element  v  in  the  universal  set 
(which  without  loss  of  generality  we  take  to  be  the  set  and  whose  elements  we  also  identify 

with  the  nodes  in  the  forest  maintained  by  the  UNION-KIND  algorithm),  let  h(v)  be  its  height  in  T. 
(Note:  this  is  a  static  quantity,  solely  depending  on  the  order  of  the  UNION'S.) 

Basic  observations: 

a)  In  any  FIND  path  (in  the  original  sequence  with  UNION’S  and  FIND’s  intermixed)  vt  -*  vt  -* 
...  -»  t>„  we  have 

h(v, )  <  h(vt)  <  ...  <  fe(vr). 

b)  Uet  v  -*  w  be  an  edge  in  a  FIND  path,  w  not  equal  to  the  root  (i.c.,  w  not  the  last  node  on 
the  FIND  path),  and  let  v  -*  w'  be  the  edge  from  v  after  the  path  compression  due  to  the  FIND 
operation.  Then  h(w')  >  h(w). 
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5.  Path  Compression 


The  first  of  these  facts  can  be  seen  as  follows.  Assume  we  have  a  UNION(7\,  T%)  in  the  original 
sequence.  Let  r,  (resp.,  r2)  be  the  root  of  T,  (resp.  Ta)  at  this  moment.  Then  we  have  in  the  rcarrangend 
sequence,  and  we  may  assume  this  by  induction,  a  UN10N(T',,  T'2)  where  T\  (resp.,  T'a)  contains  the 
same  elements  as  T,  (resp.,  Ta),  and  is  in  fact  a  rearrangement  of  Tx  (resp.,  72)  with  the  same  root  r, 
(resp.,  r2).  In  the  rearranged  sequence,  rt  becomes  a  father  of  r2,  and  hence  h(rt)  >  h(rt).  Thus  the 
same  holds  for  the  edge  r2  -»  r,  in  a  FIND  path.  Induction  on  the  number  of  edges  now  completes  the 
argument. 

The  second  fact  is  an  immediate  consequence  of  the  definition  of  path  compression  and  the  first  fact 

5.4.1.  Path  Compression  without  Weighting  Heuristic. 

We  divide  the  elements  into  disjoint  groups  and  account  for  the  cost  of  a  FIND  path  by  charging 
part  of  it  to  the  FIND  operation  and  part  of  it  to  the  nodes  on  the  path.  Note  that  the  cost  of  a  FIND 
is  roughly  equal  to  the  length  of  the  FIND  path. 

Let  the  group  C,  of  elements  consist  of  those  nodes  having  between  (bounds  included)  2<_l  and 
2*  —  1  dcscendents  in  T,  for  *  =  1, . . . ,  flog  n"|. 

For  every  FIND  path  «i  — ►  vt  v,  we  charge 

a)  the  last  edge  to  the  FIND  operation; 

b)  an  edge  vk  v*+i  with  vk  and  uk+,  in  the  same  group  to  vk+i\ 

c)  any  other  edge  to  the  FIND  operation. 

As  there  arc  only  O(logn)  groups,  every  FIND  operation  is  charged  at  most  O(logn).  Now  assume 
that  vk  -*  vk+i  and  vk.  -*  vt+,  are  (different)  FIND  path  edges  both  charged  to  vk+l  (in  different 
FIND's).  Then  vk,  vk.,  and  vk+l  are  all  in  the  same  group  C,.  It  follows,  however,  from  the  definition  of 
the  path  compression  heuristic  that  (in  T)  the  subtrees  rooted  at  vk  and  vk,  must  be  disjoint,  and  hence 
vk+i  would  have  at  least  2  •  2i_l  descendents,  contradicting  the  definition  of  group  Cx.  Hence  every 
node  is  charged  only  0(1).  Together  with  the  matching  lower  bound  (which  is  given  as  a  homework 
problem)  we  have 

Theorem: 

The  UNION-FIND  algorithm  with  path  compression  but  without  weighting  heuristic  requires 
0(n  log  n)  time. 

5.4.2.  Path  Compression  and  Weighted  UNION. 

We  use  a  simila'1  approach  as  before,  with  a  different  division  into  groups. 

Consider  the  functions 

(F(0)  :=  0) 

if  n  <  1; 
otherwise. 

(Note:  log  *F(n)  =  n.) 

Let  now  group  C<  consist  of  those  nodes  x  having  i  =  log  *(h(x)),  i.e., 

F(t  -  1)  <  h(x)  <  F(»);  for  t  =  1, ....  log  *n, 
and  C0  of  the  nodes  with  height  zero. 

The  cost  for  every  FIND  path  is  charged  in  cases  a)  and  c)  as  above  whereas  b)  is  replaced  by 
tf)  an  edge  vk  -»  ok+i  with  vk  and  vk+l  in  the  same  group  is  charged  to  vk.  Hence  every  FIND 
operation  is  attributed  a  cost  of  at  most  0(log  *n).  The  charge  to  every  vk  in  Cj,  i  >  0,  is  at  most 


F(n)  :=  2*  (with  n  2’s) 

0 

i + log  *(n°s  ni) 


5.4.  Uppeh  Bounds  for  UNION-FIND  with  Path  Compression. 
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A'(t')  -  F(i  -  1)  because  of  the  second  basic  observation  made  above.  As  there  are  at  most  n/2* 
vertices  in  T  with  h(x)  =  h  (as  we  have  shown  for  the  weighted  UNION  operation  in  4.2)  we  have 

| <7.1  <  V  -  <  2 _ - _ -  _ 

~  ,=p^,)+1  2J  ~  2',C-')+*  F{i)' 

Hence  all  vertices  in  C,  are  charged  together  at  most  O(n).  Summing  up  all  the  pieces  we  obtain 
the 

Theorem: 

The  UNION-FIND  algorithm  with  path  compression  and  weighted  UNION  requires  0(n  log  *n) 
time. 

By  a  much  more  sophisticated  analysis,  which  uses  a  multilevel  subdivision  into  groups  (we  only  used 
one  level),  one  can  improve  the  upper  bound  for  this  variant  of  the  UNION-FIND  algorithm  to 

0(ma(m,n)) 

where  a(m,  n)  is  closely  related  to  die  "inverse"  of  Ackermann’s  function  A(m,  n.)  with 

A(m,0)  =  1; 

A(0,n)  =  2n  +  1; 

A(m  +  1,  n  +  1)  ==  A(m,  A(m  +  1, »));  for  m,  n  >  0. 

Ackermann’s  function  grows  faster  than  any  primitive  recursive  function.  As  a  consequence,  a(m,  n)  <  6 
for  all  "possible”  values  of  m  and  n.  For  more  details,  we  refer  the  reader  to  p’ar75J.  In  (Tar77],  a 
matching  lower  bound  for  the  UNION-FIND  problem  is  proven  for  a  very  general  class  of  machines. 
Finally,  p'ar79]  gives  some  extensions  of  path  compression  to  problems  which  require  the  computation  of 
certain  functions  along  FIND  paths.  However,  for  the  lowest  common  ancestor  problem  discussed  there 
also  compare  [Har80], 


Chapter  6 


Pattern  Matching  in  Strings 


6.1.  Pattern  Matching  Problems. 

We  are  interested  in  finding  interesting  patterns  in  strings  of  characters.  There  arc  many  ways  that 
this  problem  can  be  made  more  rigorous,  but  first  some  definitions.  Let  E  be  some  finite  alphabet.  There 
are  two  strings  of  interest,  the  text,  text,  and  the  pattern,  pat.  The  following  notations  are  used: 

text  —  tj. ..t„,  tj  E  E 

textiti  =  <<...  t, 
pat  =  pi. .  p<  G  £ 

!»«'"  =  ff-piX,  vfez 

We  are  interested  in  the  following  problems: 

1.  Find  the  first  (all)  occurrcncc(s)  of  pat  as  a  substring  in  text. 

2.  Find  the  first  (all)  occurrcnce(s)  of  any  of  the  pat as  a  substring  in  text. 

3.  Find  the  longest  common  substring^)  of  text  and  pat. 

4.  For  some  (each)  position,  t,  in  test,  find  the  position(s)  fc  ^  i,  such  that  the  common  prefix  of 
textjjB  and  text*,n  has  maximal  length.  This  problem  is  called  “internal  matching". 

5.  For  some  (each)  position,  i,  in  text,  find  the  position(s)  k  in  pat  such  that  the  common  prefix  of 
testi,n  and  patk.m  has  maximal  length.  This  problem  is  called  “external  matching”. 

6.  Problem  2,  except  that  the  paf(<)  are  given  by  a  regular  expression. 


6.2.  Sets  of  Patterns  Given  by  Regular  Expressions. 

We  will  consider  the  last  problem  first  Let  a  be  a  regular  expression  over  E,  and 

M.  —  [S,  s0,  f, 6) 

be  a  non-deterministic  finite  state  automaton  recognizing  La  C  E*.  Suppose  that  Ma  has  the  following 
properties: 

a)  M*  may  contain  e-transitions; 

b)  |*(«,a}|  <  2,  Vo€£u{e}; 

c)  |S|<2|a|. 

Here  the  length  of  the  regular  expression,  |a|,  is  computed  by  counting  one  for  a  symbol  from  E 
that  appears  and  one  for  each  connective  including  concatenation.  It  is  easy  to  construct  an  Ma  from  a 
with  these  properties  by  structural  induction. 


6.3.  Automata  Recocnizinc  Substrings. 
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Wc  now  compute  the  sequence  St  of  sets  of  states  that  Ma  could  be  in  after  having  read  <eztlit. 
algorithm  rcg_pat(a,<ex<); 

begin 

construct  Ma  —  (S,a0,f,6)\ 

for  i:=0  to  n  do 

begin 

if  i=0  then  50  :=  {s0}  else  5,  :=  ^(»,  d»): 

mark  all  s  €  5,  ; 
unmark  all  a  £  5  -  5,-; 

Q:= queue  of  elements  in  5,  ; 
while  Q  is  not  empty  do 
begin 

take  an  clement,  say  a,  from  Q; 
add  each  unmarked  s'  £  6(8,  c)  to  S,  and  Q  and  mark  it 
end 

co  wc  have  just  closed  S,  under  e-transitions  oc; 
if  f  £  Si  and  i  >  0  then  mark  t,  with  _ 

co  the  marked  elements  arc  the  the  places  where  an  instance  of  an  clement  of  La  ends  oc 

end 

end. 

It  is  obvious  that  exactly  every  prefix,  text,ti  of  text  ending  with  an  underscored  f,  is  in  La. 

The  construction  of  Ma  can  be  done  in  time  0(|a|).  Clearly  the  running  time  of  the  algorithm 
reg_pat  is  bounded  by 

O(n)Sj)  =  0(n\a\), 

because  in  the  inner  loop,  every  a  £  S  is  put  on  the  queue  at  most  once  and  |£(s,  a)|  <  2. 

Extensions: 

If  we  are  looking  for  a  pattern  La  in  the  middle  of  text ,  first  replace  a  by  E*a. 

How  to  find  the  left  cnd(s)  of  pattcrn(s),  ha,  is  left  as  a  homework  problem. 

Better  upper  bounds  can  be  obtained  for  some  special  forms  of  «.  For  more  information  see  [KMP77] 
and  [FiP74], 


6.3.  Automata  Recognizing  Substrings. 


Consider  for  now  problem  1,  that  of  finding  the  first  (all)  occurrence^)  of  pat  as  a  substring  in  text. 
This  is  an  instance  of  the  previous  problem  where  the  regular  expression  a  is  an  clement  of  E*.  Consider 
the  “skeletal  machine”,  M'pat: 


Pm 


I 


6.  Pattern  Matching  in  Strings 


General  idea : 

M'pit  reads  ft,  ft, . . .  in  state  s0  until  the  first  t,  =  p,  is  encountered.  Then,  M'fat  changes  state  to 
•i.  Suppose  that  after  having  read  textlik,  M'pmt  is  in  state  s}.  This  implies  that 

textk-i+i,k  =  pat lji  and 

^  patij,,  if  j'  >  j. 

Now  Mppt  reads  f*+1.  ff  ft+,  =  py+,  then  M'pat  changes  state  to  ay+,  and  iterates.  If  ft+i  py+l. 

then  there  can  be  no  complete  match  at  position  k  —  j  +  !.  That  is 

fexf*_y+ i,*_y+m  7^  pat. 

We  may  therefore  try  a  match  at  a  later  position  by  shifting  the  pattern  sh(j,  tk+l)  positions  to  the 
right.  Sometimes,  sh(j,tk+i)  can  be  greater  than  one,  since  we  know  that  fext*_y+li(k  =  pot, >y.  In  fact 
we  can  define 

ah(j,tk+l)  :=  min{s  >  0;  fexft+,_y+,,t+l  =  pof,iy_.+  l}. 


-  s - ■>  1  P,  . 

• . . 

Pi-. 

Pi-.+  l 

1  Pi  •  ■  •  p«+l 

. . . 

Pi 

Pi+t  ... 

1  tk—j+l _  + 1 

. . . 

ft 

ft+l 

This  is  always  defined  since  if  s  =  j  +  1,  textk+2<k+i  and  pat,,0  arc  empty.  For  Mpat,  this 
corresponds  to  reading  f*+,  and  going  from  state  sy  to  state  sy_,+ ,  if  tk+l  ^  py+I.  It  is  possible  to 
build  the  shift  function  into  Mpat,  obtaining  a  DFA,  Mpat,  with  0(m)  states  and  0(m|E|)  edges. 

It  turns  out  that,  it  takes  time  0(m|E|)  time  to  construct  ah,  hence 

Theorem: 

We  can  determine  whether  pat  is  a  substring  of  text  in  time  0(m|E|  +  n). 

If  £  is  large  the  time  to  set  up  the  machine  might  be  significant.  This  leads  to  the  question:  is  there 
an  algorithm  that  is  independent  of  |E|? 


6.4.  The  Knuth-Morris-Pratt  Pattern  Matching  Algorithm. 

.  To  become  independent  of  |£|,  we  make  sh  independent  of  E  and  construct  a  new  machine  Mpat. 

sh'(j)  :=  min {ah(j,  a);  a  G  £  —  py+i}- 

Note  that  if  |E|  <  2  the  problem  is  silly.  Therefore, 

•h'(j)  :=  min{«  >  0;  pat.+iii  =  patltj~.  and  p,+,  ^  P,-,+i} 

If  j  <  •  then  pat,,,-,  and  pat,+ ,,y  are  empty  and  equality  holds.  If  a  =  j  +  1  then  both  conditions 
are  vacuously  satisfied.  This  means  that  0  <  ah'(j)  <  j  +  1.  If  we  build  ah'  into  Mpot,  we  might,  after 
going  from  state  «y  to  state  «y_„  still  have  a  mismatch  py_.+  ,  ^  tk+ ,.  In  this  case  it  is  necessary  to 
iterate  by  reducing  the  state  according  to  ah'.  This  gives  the  following  program: 
j  :=  0;  k  :=  0; 
while  j  <  m  and  fc  <  n  do 
while  j  >  0  ^  py+,  do 


6.5.  The  Doyer-Moore  String  Matching  Algorithm. 


31 


begin 

co  and„n4  means  that  the  second  operand  will  not  be  evaluated  unless  the  first  operand  is  true  oc; 

j  :=  j  -  ih'(j); 

k:=k  +  l;  j:=j+ 1; 

if  j  ==  m  then  “match  encountered" 

end. 

This  is  the  basic  Knuth-Morris-Pratt  algorithm.  It  can  be  refined  in  several  ways  For  details  s< 
[KMP77J. 

We  now  want  to  see  how  to  construct  the  shift  function.  The  basic  observation  is  that  if 
sh'(0),sh'( l ah'(j  —  1)  are  known,  it  is  easy  to  compute  Bh'(j).  For  the  following  progi 
to  compute  the  shift  function,  it  is  useful  to  recall  that 

ah'(j)  :=  min{s  >  0;  pat,,,-..  =  patJ+lly  and  pJ+1  ^  p,-,+  i} 
and  to  consider  the  following  figure: 

-  S  - *■  1  Pi  •••  Pi-t- 1  Pi-t  Pi-j+I 

_ = _ =  =  7^ 

L  _Pi  •••_  Ei± I  •••  Pi- 1  Pi  Py+i 

The  following  program  computes  sh'(j): 

co  we  shift  the  upper  copy  of  the  pattern  to  the  right  by  whatever  amount  is  appropriate  oc 
»h'(0)  :=  0;  *  :=  1;  j  :=  1; 

while  j  <  m  do 
begin 

while  a  <  j  undconrf  py  ^  p,_,  do 
a  ;=  g  +-  sh'(j  —  a—  1); 

if  Pj+i  —  Pj-  »+i  then  8h'[j) s  +  -  a)  else  8h'(j)  :=  a; 

j  j  +  1 

end. 

Whenever  the  inner  loop  of  the  algorithm  is  executed,  8  increases.  But  s  is  boundcu  from  above  by 
m  +  I .  The  variable  j  increases  once  each  time  the  outer  loop  is  executed  and  is  bounded  from  above 
by  m.  Therefore,  this  algorithm  has  time  complexity  O(m).  Similarly  it  holds  for  the  main  algorithm 
that  whenever  the  inner  loop  is  executed,  j  is  decreased.  But  j  is  always  at  least  -1  and  is  increased  at 
most  n  times  by  1.  Therefore  the  total  time  for  the  algorithm  is  0(m  +  n). 


6.5.  The  Boyer-Moorc  String  Matching  Algorithm. 

The  Boyer-Moore  algorithm  [BoM77]  uses  the  same  basic  idea  as  the  KMP  algorithm,  but  it  compares 
the  pattern  against  the  text  starting  at  the  right  end  of  the  pattern  and  is  thus  often  able  to  skip  more 
rapidly  over  a  part  of  the  text  that  can  not  possibly  contain  a  match.  Two  guiding  rules  arc  used  to 
dctcnninc  the  shift: 

a)  if  the  next  character  in  text  to  be  compared  is  t,  determine  the  rightmost  position  (default  0)  where 
t  occurs  in  pat\ 

b)  choose  a  >  1  minimal  such  that  : 
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It  can  be  proved  that  with  this  shifting  strategy,  the  Boycr-Moorc  algorithm  has  worst-case  complexity 
(if  no  occurrence  of  the  pattern  is  found)  of  O(n).  For  a  proof  see  [K.MP77,  GuO80], 

On  the  average,  the  algorithm  preforms  much  better.  In  this  case  the  average  is  based  on  the 
assumption  that  all  text  strings  of  a  given  length  are  equally  likely.  Assume  that  q  :=  |E|  is  rather  large, 
and  let  r  :=  [2  log,  mj.  Thus  the  probability  that  some  texti,-,+ 1,*  occurs  in  pat  is  at  most 

m  -  r  +  1 

because  there  are  qr  equally  likely  possibilities  for  textm_T+l  and  at  most  m  -  r  +  1  of  them  occur 
in  the  pattern.  If  a  match  occurs,  we  need  0(m)  steps  to  find  all  the  matches  in  positions  k  —  m  +  1 
through  k  —  r,  and  have  shifted  the  pattern  by  m  —  r.  If  there  is  no  match,  we  can  shift  the  pattern  by 
m-r  immediately.  Hence  the  average  time  complexity  is 


on  a  random  text 


6.6.  Space  Efficient  Linear  Pattern  Matching. 

The  pattern  matching  algorithms  discussed  so  far  use  auxiliary  space  of  size  n(m)  to  store  the  values 
of  the  shift  function.  We  now  investigate  a  string  matching  algorithm  (also  of  linear  time  complexity) 
that  needs  only  a  constant  amount  of  auxiliary  storage.  Contrary  to  the  KMP  algorithm  this  algorithm 
must  be  able  however,  to  read  the  input  tapes  (for  the  text  and  the  pattern)  in  both  directions.  The 
algorithm  is  based  on  properties  of  repeating  patterns  in  strings.  We  shall  first  give  some  definitions  and 
basic  lemmata,  and  we  assume  in  the  following  that  k  is  some  fixed  positive  integer  >  4. 

Definition: 

a)  A  string  a  is  a  period  of  a  string  w  if  w  is  a  prefix  of  some  a"  (i.c.,  iff  w  is  a  prefix  of  zw). 

b)  For  every  p  <  |to|,  we  set 

reachw(p)  :=  max{g;  wliP  is  a  period  of  w,^}. 

c)  A  string  z  is  basic  if  it  is  not  of  the  form  z1'  for  any  z’  and  *  >  1. 

d)  A  string  a  is  a  prefix  period  of  w  if  z  is  basic  and  zk  is  a  prefix  of  w  (i.e.,  reachw(\z\)  >  A|z|). 

Periodicity  Lemma:  If  \w\  >  pt  +  p*  and  w  has  periods  of  length  pt  and  ps  ^  p,  then  it  also  has  a 
period  of  length  gcd{pt,  p*). 

Proof:  Assume  without  loss  of  generality  that  ps  >  Pi-  As  wPj_Pl+Up,  =  wp,+itP,+Pl  and  u^.+i.M  = 
because  w  has  a  period  of  length  p i,  and  as  wPj+i,|«,|  =  because  w  has  a  period 

of  length  p,,  we  also  have  +i,h  =  an^  hence  w  has  a  period  of  length  p*  -  pt. 

Iterating  this  argument  as  in  Euclid's  algorithm  we  obtain  the  lemma.  | 

Corollary:  If  z  and  a*  are  prefix  periods  of  w,  with  \z’\  >  \z\,  then  \z'\  >  (k  -  l)|x|. 

Proof:  Assume  to  the  contrary  that  \z'\  <  (k  -  1)| z\.  Then  tizi  *|,|  has  periods  z  and  z'  and  length 
>  |z|  +  \z*\.  Hence  by  the  Periodicity  Lemma,  z  or  z'  cannot  be  basic.  But  this  contradicts  the  definition 
of  prefix  period.  | 


l 
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Decomposition  Theorem:  Each  pattern  pat  can  be  decomposed  pat  =  uv  such  that  v  has  at  most  one 
prefix  period  and  |u|  =  0(shiftv(\v\))  where 

shiftv(j)  :=  min{a  >  0;v,+iJ  =  for  j  =  l,...,jv|. 

Furthermore,  such  a  decomposition  can  be  found  in  time  O(m). 

We  postpone  the  proof  of  this  Decomposition  Theorem  for  the  moment  and  first  show  how  the 
space  efficient  string  matching  algorithm  works.  Basically,  it  checks  for  occurrences  of  the  pattern  suffix 
v  in  the  text,  and  whenever  it  finds  one  naively  checks  for  u  to  the  left  of  it  (here  the  algorithm  has  to 
back  up  the  reading  heads  on  the  input  tapes).  The  algorithm  uses  the  properties  of  v  in  shifting  the 
pattern  (suffix)  in  case  of  a  mismatch  by  an  amount  which  is  basically  a  fixed  portion  of  the  length  of 
the  matched  prefix. 

algorithm  pmatch(<ex<,  pat): 
begin 

co  A;  is  a  fixed  integer  >  4  oc; 

co  for  the  sake  of  simplicity  we  assume  that,  for  j  =  |v|  +  1,  v3  produces  a  character  which  does  not 
occur  at  all  in  text  oc; 

decompose  pat  =  uv  as  stated  in  the  Decompostion  Theorem; 

*  ~  M;  j  •=  0; 

while  i  <  n  —  |v|  and  j  <  |v|  do 

if  i,+J+l  ^  vJ+,  then 

begin 

case  v  has  a  prefix  period  (of  length  r)  of 
false:  begin  t  :=  i  +  rnax{l, \j/k]}\  j  0  end; 
true  :  if  kr  <  j  <  reachv(r )  then 

begin  » :=  i  +  r;  j  :=  j  —  r  end 

else 

begin  i  :=  »  +  max{l,  fj/A:]};  j  :=  0  end 

end 

else 

begin 

j  :=s  j  +  1; 

if  j  =  |w|  andend  text^ |„|+i,,  =  u  then 
“match  found  at  position  t  -  |u|  +  1” 

end 

end  pmatch. 

Proof  of  the  correctness  of  the  algorithm : 

Again  it  is  appropriate  to  imagine  that  the  pattern  (which,  in  this  case,  is  the  suffix  v  of  the  original 
pattern  pat)  slides  on  top  of  text  to  the  right.  If  tcxti+lii+j  matches  and  a  mismatch  occurs  in  the 
next  position  we  can,  without  possibly  missing  a  match  in  between,  shift  the  pattern  shiftv(j)  positions 
to  the  right  where  akiftv(j)  is  as  defined  in  the  Decomposition  Theorem: 

ahiftv(j)  :=  min{«  >  0;  v,+  lJ  = 

In  the  following  we  show  that  in  the  algorithm  pmatch  the  pattern  is  always  shifted  a  distance  < 
shiftv(j)  and  hence  that  no  occurrence  of  v  in  text  can  be  missed. 

Assume  first  that  u  has  no  prefix  period.  Then  the  algorithm  always  shifts  the  pattern  (in  case  of  a 
mismatch)  by  max{  I,  [>/*]}  where  j  is  the  length  of  the  prefix  of  v  which  matches  the  text  after  the 
position  given  by  the  current  value  of  t.  In  order  for  the  algorithm  to  be  correct  (in  this  case)  we  must 
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have  j/k  <  »hift„(j).  If  we  assume  to  the  contrary  that  k  >  k-  ahiftv(j)  then  clearly  (t>i, 
would  be  a  prefix  of  vti)  contradicting  the  assumption  chat  v  has  no  prefix  period. 

Now  assume  that  v  has  a  prefix  period  of  length  r  >  1.  If  kr  <  j  <  reach„(r)  then  certainly 
ahift,(j)  >  r  because  ahiftv(j)  <  r  would  imply  as  above  that  v  had  a  (second)  prefix  period  of 
length  <  8  hiftv(j)  <  r.  If  j  <  kr  then  ahiftv(j)  <  j/k  again  would  imply  a  second,  shorter  prefix 
period.  If  j  >  reachv(r)  assume  again  that  shiftv(j)  <  j/k.  Then  vt  <y)  is  a  period  of  vtJ. 
Hence  there  is  either  a  second  prefix  period  for  v,  or  j  <  reachv(r),  both  contradicting  our  assumptions. 

Hence  the  algorithm  is  correct.  | 

Analysis  of  the  running  lime  of  pmatch: 

As  long  as  we  do  not  count  the  time  for  the  decomposition  of  pat  and  for  the  checks  <ezt,_ |u|+i,i  = 
u,  the  integer  quantity 

(fc  +  1)»  + j 

increases  every  0(1)  steps  (this  is  clear  as  ( k  +  1)(»  +  fy/fc]}  >  (k  +  1)*  +  (k+  \)j/k  >  (k  +  1)»  +  j). 
But  this  quantity  is  bounded  by  (k  +  l)n  4-  m  =  0(n  +  m).  Each  time  an  occurrence  of  v  is  discovered 
in  the  text,  the  algorithm  naively  checks  for  u  to  the  left  of  it  using  additional  time  0(|u|)  (if  for  this 
test  it  is  necessary  to  first  reset  the  input  heads  |v|  positions  this  time  is  going  to  be  accounted  for  in 
the  0(n  4-  m)  time  to  find  all  occurrences  of  v).  But  v  can  be  found  at  most  n/ shift V(}v\)  times  in 
text.  As  |u|  =  0(ahi the  total  time  for  these  tests  is  0(n).  Together  with  the  bound  for  the 
decomposition  of  the  pattern  as  stated  in  the  Decomposition  Theorem,  we  obtain  the 

Theorem: 

The  algorithm  pmatch  finds  all  occurrences  of  pat  in  text  in  time  0(n  +  rn)  and  uses  only  a  fixed 
number  of  (auxiliary)  memory  cells. 

Let  us  finally  sketch  now  a  proof  for  the  Decomposition  Theorem. 

As  in  the  computation  of  the  shift  function  for  the  KMP  algorithm  we  match  pat  against  itself.  We 
actually  run  pmatch  with  pat  =  text  and  initially  v  =  pat  and  u  the  empty  string.  Whenever  we 
incur  a  mismatch  pJ+i  yA  p<+J+,  we  shift  the  upper  copy  of  pat  by  max{l,  \j/k\)  until  for  the  first 
time  j  —  ki  holds.  Up  till  (but  not  including)  then  no  prefix  period  has  been  found  so  the  shifting 
just  indicated  is  correct  as  we  have  argued  in  the  correctness  proof  for  pmatch.  When  j  reaches  ki 
for  the  first  time  then  z  patlfi  is  the  shortest  prefix  period  of  pat.  We  now  check  whether  pat 
has  a  second  prefix  period  (whose  length  then  must  be  >  (k  —  l)r,  by  the  Corollary  to  the  Periodicity 
Lemma).  Note  that  after »  +  j  reaches  reachpat(r),  j  gets  decreased  to  zero.  We  continue  pmatch  with 
the  shifting  done  as  stated  in  the  algorithm  for  the  case  where  the  pattern  has  a  prefix  period  of  length 
r  :=  |z|  until  for  the  second  time  j  equals  ( k  —  l)i.  If  instead  we  reach  the  end  of  pat  then  there  is  no 
second  prefix  period  and  we  can  set  v  :=  pat  and  ]wj  =  0.  Otherwise  the  length  of  this  second  prefix 
period  of  pat  must  be  greater  than  reachpat(r)  -  r,  by  the  Periodicity  Lemma.  We  now  cancel  the  initial 
[reac/i,.l(r)/rj  -  k  +  1  copies  of  z  from  both  copies  of  pat.  After  this,  z  is  no  longer  a  prefix  period 
of  the  remainder  pat’,  and  it  follows  again  from  the  Periodicity  Lemma  that  pat'  cannot  have  a  prefix 
period  shorter  than  reachpot(r)  —  r.  Hence  the  absolute  position  of  the  pointers  i  and  j  in  pat'  docs  not 
change  with  respect  to  pat  by  the  deletion  of  the  prefix.  Hence  we  can  iterate  the  above  process. 

Note  that  in  one  iteration  the  length  of  the  deleted  prefix  is  at  most  the  length  of  the  next  longer 
prefix  period.  Therefore,  the  proportion  of  the  total  length  of  the  cancelled  prefixes  to  the  length  of  the 
last  prefix  period  is  bounded  by  1  +  1  /k  +  1/fc*  + . . .  <  2.  As  certainly  8fit/fv(|v|)  is  not  less  than  the 
length  of  this  last  prefix  period  ( v  is  of  course  the  last  paf\)  we  obtain 

|u|  =  0(8/u'/fu(|v|)). 

Finally  we  remark  that  this  decomposition  algorithm,  for  the  same  reasons  as  the  main  algorithm 
pmatch,  runs  in  time  0{m)  and  uses  only  a  fixed  number  of  auxiliary  memory.  For  more  details  and  an 
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implementation  of  pmatch  on  multitape  Turing  machines,  which  runs  in  real-time,  the  reader  is  referred 
to  [GaS811. 


6.7.  Position  Trees. 

Another  way  to  solve  practically  all  of  the  problems  mentioned  in  6.1.  is  to  use  a  data  structure 
called  position  trees  ,  also  called  prefix ,  suffix,  or  bi-trec  [AHU74,  MaR80,  McC76,  Wei73].  Basically,  for 
every  position  t  in  text  the  shortest  prefix  p,  of  text,  n  is  determined  which  uniquely  identifies  position 
i.  This  means  that  whenever  text  is  decomposed  into  «p,i>  then  u  must  be  The  existence 

of  such  a  position  identifier  can  be  guaranteed  by  appending  to  text  a  special  endmarkcr  which  does 
not  occur  elsewhere.  The  position  identifiers  p,  are  then  stored  in  a  tree  whose  leaves  arc  in  one-to-one 
correspondence  to  the  positions  of  text ,  and  which  also  contains  non-tree  edges  (also  labelled  with 
elements  from  the  alphabet)  encoding  the  shift  function.  It  is  pretty  straightforward  (though  tedious)  to 
construct  position  trees  (respectively  a  compacted  variant  thereof)  in  linear  time  and  space  when  reading 
text  backwards.  One  of  the  references  also  gives  an  efficient  construction  for  the  case  when  text  has  to 
be  read  on-line  from  left  to  right.  For  more  details  sec  the  references  given  above. 


Chapter  7 


Searching  Graphs  and  Applications 


7.1.  The  Labelling  of  Trees. 

For  the  systematic  search  of  trees  it  is  often  useful  to  attach  to  the  nodes  numbers  or  labels  which 
give  some  information  about  the  position  of  a  node  in  die  tree.  Of  course  there  are  many  ways  to  specify 
positions  in  trees  or  graphs.  We  shall  discuss  the  following  standard  labellings  of  rooted  or  directed  trees 
(note  that  there  is  no  essential  difference:  a  directed  tree  is  necessarily  rooted,  and  a  rooted  tree  can,  in 
a  unique  way,  be  made  directed): 

-  preordcr  numbering; 

-  postordcr  numbering; 

-  inordcr  numbering  for  binary  trees; 

-  level  numbering; 

-  descendant  numbering. 

We  now  specify  these  numberings  in  turn,  by  giving  short  pieces  of  programs  which  generate  them. 
The  variables  and  data  structures  used  should  be  self-explicatory. 

(a)  procedure  preordcr(node:  v); 
begin 

num  :=  num  +  1;  pre[v]  :=  mim; 
for  all  w  €  sons[t)]  do  prcorderftu) 

end; 

num  :=  0;  preorder (root); 

Hence,  whenever  w  is  a  proper  descendant  of  v,  then  pre\w\  >  pre (»]. 

(b)  procedure  postorderfnodc:  v); 
begin 

for  all  w  6  aons[u]  do  postordcrftu); 
num  :=  num  +  1;  posl[v]  :=  num 

end; 

num  :=  0;  postorder(roo<); 

Hence,  whenever  tv  is  a  proper  descendant  of  v,  then  post[w ]  <  posf[v]. 

(c)  procedure  inorderfnodc:  v); 
begin 

if  v  has  a  left  son  w  then  inordeifu;); 
num  :=  num  +  1;  *n[v]  :=  num; 
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if  v  has  a  right  son  w'  then  inordcrfiu') 

end; 

num  :=  0;  inorder(roo0; 

Note  that  in  binary  trees  there  is  a  distinction  between  left  and  right  sons.  Let  w  be  a  descendant 
of  the  left  son  of  v,  and  w'  a  descendant  of  the  right  son.  Then  we  have 

tn[to]  <  m[v]  <  tnjw'J. 


(d)  procedure  lcvcl(node:  v); 
begin 

for  all  w  €  aona[v\  do 

begin 

lev[w\  :=  lev[v]  +  1;  lcvcl(io) 

end 

end; 

lev[root]  :=  1;  levcl(roof)  co  another  variant  sets  lev[root\  :=  0  oc; 

It  is  obvious  that  lev[v\  equals  the  number  of  nodes  on  the  (unique)  path  from  the  root  of  the  tree 
to  v. 

(c)  procedure  desccndants(nodc:  v)\ 
begin 

for  all  w  6  sons[u]  do  dcscendants(iu); 

<fes[o]  :=  1  co  v  is  considered  a  descendant  of  itself  oc; 
for  all  w  6  sons\v ]  do  des[vj  dea\v]  +  dea[w] 

end; 

dcsccndants(roof); 

Clearly,  des[o]  counts  the  number  of  descendants  of  the  node  v. 

There  are  many  (more  or  less  trivial)  relations  between  these  numberings.  As  an  example  we  state 
(and  leave  the  proof  as  an  easy  exercise)  the  following 

Descendants  Lemma;  Let  v  and  w  be  nodes  in  a  (rooted)  tree  and  assume  that  the  preorder  and 
postorder  procedures  visit  the  sons  of  every  node  in  the  same  order.  Then  the  following  four  conditions 
arc  equivalent; 

-  w  is  a  descendant  of  v ; 

-  frre\v]  <  pre[w J  <  pre[t>]  +  des[ v]; 

-  post[v\  —  des[v\  <  posf[w]  <  pos([v]; 

-  pre\v]  <  pre\w\  and  post\w)  <  post[v\. 

Hence  we  might  note  as  a  corollary  that  pre  and  post  together  uniquely  determine  the  structure  of 
the  tree. 


7.2.  Search  in  Graphs. 

Many  search  problems  on  graphs  contain  or  arc  special  variants  of  the  following  general  problem: 
’’Search  all  edges  of  a  graph  G  —  (V ,  li ),  number  the  nodes  in  some  order  from 
1  through  n  =  )K|,  and  find  a  spanning  forest  (i.c.,  a  set  of  disjoint  trees  with 
edges  from  li  which  contain  all  nodes  in  V)  for  the  graph.” 

For  this  general  problem,  we  give  the  following  skeleton  of  an  algorithm  (formulated  for  the  case  of 
an  undirected  graph;  the  generalization  to  digraphs  is  straightforward): 
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while  there  is  an  unnumbered  node  left  in  the  graph  do 

begin 

select  an  unnumbered  node  fc; 

select  (if  there  is  one)  an  edge  e  between  a  numbered  node  and  fc,  add  it  to  the  forest  and  perform 
actions_on_forest_edgcs(e); 

for  all  other  edges  e'  between  numbered  nodes  and  fc  do 

actions_on_non-forest_edges(e'); 

number  fc  with  the  next  available  number 

end. 


If  we  now  fill  in  the  particular  rules  for  the  selection  of  the  node  fc  and  the  edge  e,  we  obtain  a 
variety  of  graph  searching  methods.  Some  of  the  most  important  arc  listed  below,  together  with  their 
selection  criteria. 

BFS  (breadth- first- search): 

select  the  node  fc  and  the  edge  e  such  that  the  other  endpoint  of  e  has  the  lowest  possible  number. 
DFS  ( depth-first-search ): 

select  the  node  fc  and  the  edge  e  such  that  the  other  endpoint  of  e  has  the  highest  possible  number. 
TS  (topological  search): 

select  a  node  fc  with  a  minimal  number  of  edges  from  unnumbered  nodes;  select  c  arbitrarily. 

MCS  (maximum  cardinality  search): 

select  a  node  fc  with  a  maximal  number  of  edges  from  unnumbered  nodes;  select  e  arbitrarily. 

We  shall  discuss  applications  of  these  search  methods  for  graphs  in  the  sequel. 


7.3.  Connectivity. 

An  undirected  graph  G  =  (V ,  E)  is  connected  if  for  any  two  nodes  v,w  £  V  there  is  a  path  in  G 
from  v  to  w.  A  directed  graph  is  called  connected  if  its  undirected  variant  is  connected.  It  is  obvious 
how  to  use  DFS  or  BFS  in  order  to  determine  the  connected  components  (i.e.,  the  maximal  connected 
subgraphs)  of  a  graph. 

7.3. 1.  Biconnectivity. 

Assume  that  G  =  (V,  E)  is  a  connected  undirected  graph.  We  want  to  know  whether  G  can  become 
disconnected  if  we  just  remove  one  of  its  nodes  (and  the  edges  incident  on  that  node). 

Definition: 

A  node  a  £  V  is  called  an  articulation  point  of  G  if  there  are  nodes  v,w  £  V  different  from  a  such 
that  every  path  from  v  to  w  passes  through  a.  A  (connected  undirected)  graph  G  is  called  biconnecled 
if  it  has  no  articulation  point. 

Obviously  the  removal  of  an  articulation  point  from  a  graph  disconnects  the  graph.  More  general 
we  could  be  interested  in  how  many  nodes  it  takes  to  disconnect  a  graph.  The  minimal  number  for  this 
is  called  the  (node)  connectivity  of  the  graph.  Hence  a  connected  graph  with  no  articulation  point  has 
connectivity  >  2,  it  is  fifconnected. 

Lemma: 

(a)  G  is  biconncctcd  iff  any  two  distinct  of  its  edges  lie  on  a  common  simple  cycle.  i 

(b)  The  property  of  lying  on  a  common  simple  cycle  gives  rise  to  an  equivalence  relation  in  the  edge  set 
E.  Its  classes  are  called  biconnecled  components,  and  their  induced  subgraphs  blocks. 
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Proof:  The  proof  for  (b)  is  immediate,  and  the  proof  of  (a)  is  given  as  a  homework  problem. 

Lemma: 

Let  G,  =  [Vjt  Ej)  be  the  blocks  of  G. 

(a)  For  all  j,  |V,  n  V,|  <  1. 

(b)  A  node  a  is  an  articulation  point  iff  a  G  V*  ft  Vy,  for  some  i  ^  j. 

Proof:  (a)  Assume  that  for  some  x  ^  j  there  are  two  distinct  nodes  v  and  w  in  V,-  fl  Vy.  As  G,  and  Gy 
arc  both  connected  there  are  simple  paths  from  u  tow  wholly  within  G,  respectively  Gy  and  hence  edge 
disjoint  (remember  that  the  E,  form  a  partition  of  E).  But  it  is  obvious  how  to  construct  from  these  two 
paths  a  simple  cycle  containing  edges  both  from  £,  and  E}.  Contradiction. 

(b)  If  a  is  an  articulation  point  then  there  are  (necessarily  distinct)  v,w  such  that  every  path  from 
v  to  w  passes  through  a.  As  G  is  connected  there  is  at  least  one  such  simple  path.  Let  x  and  y  be 
the  two  nodes  on  this  path  next  to  a.  Then  the  edges  (x,a)  and  { y,a }  must  necessarily  be  in  two 
different  biconnectcd  components  because  otherwise,  by  the  previous  Lemma,  there  would  be  a  simple 
cycle  containing  these  two  edges,  and  a  could  not  possibly  be  an  articulation  point. 

On  the  other  hand,  if  a  G  Pin  Vy  for  some  i  ^  j,  then  there  are  edges  {x,  a}  G  E<  and  {a,  y}  G  Ej, 
and  every  path  from  x  to  y  has  to  pass  through  a  because  otherwise  there  would  be  a  simple  cycle 
containing  these  two  edges  which  would  contradict  the  fact  that  they  come  from  distinct  biconnectcd 
components.  But  then  a  is  an  articulation  point.  | 

We  can  think  of  a  DFS  being  implemented  recursively  in  the  same  way  as  the  preordcr  labelling 
routine,  and  we  say  that  a' DFS  visits  a  node  v  whenever  the  instance  of  the  recursive  search  procedure 
for  node  v  is  active.  . ' 

If  we  perform  a  dcpth-first-scarch  on  a  connected  undirected  graph  G  =  {V  ,E)  we  obviously  obtain 
a  spanning  forest  consisting  of  one  tree  T  =  (V,  E').  We  also  make  the  following  basic  observation 
about  the  non-tree  edges  in  E  -  E',  from  the  definition  of  DFS: 

-  Every  po'n-trcc  edge  {v,  w}  (encountered  when  visiting  v  in  the  DFS)  is  a  back-edge,  i.e.,  to  is  an 
ancestor  of  v  in  the  tree  constructed  so  far  (and  hence  also  in  T). 

ydTiis  fact  permits  us  the  following  notational  convention:  an  edge  {v,u»}  G  E  is  written  v  -*  w  if  it 
i?  a  tree  edge  and  w  is  a  descendant  of  v  in  the  tree,  and  it  is  written  v  w  if  it  is  a  back-edge  from  v  to 
,  its  ancestor  w.  Wc  also  write  v  w  for  the  (possibly  empty)  path  from  v  to  its  descendant  w  in  the  tree. 
Having  in  mind  that  all  non-tree  edges  of  G  arc  back-edges  and  that  the  removal  of  an  articulation 
/  disconnects  G  wc  immediately  obtain  the  following  characterization. 

Lemma: 

Let  G  =  {V,E )  be  a  connected  undirected  graph,  and  T  =  ( V,E ')  a  DFS-tree  of  G.  A  node 
a  G  V  is  an  articulation  point  of  C  iff  either 

(a)  a  is  the  root  of  T  and  has  more  than  one  son;  or 

(b)  there  is  a  son  s  of  o  such  that  no  descendant  of  a  (including  s  itself)  has  a  back-edge  to  a  proper 
ancestor  of  a. 

Proof:  It  is  clear  from  the  observation  stated  above  that  a  is  an  articulation  point  if  either  one  of  the 
two  conditions  in  the  Lemma  holds.  Now  assume  that  a  is  an  articulation  point  in  G.  If  o  is  the  root 
of  T  and  has  only  one  son  then  G  docs  not  become  disconnected  if  we  remove  a  and  its  incident  edges 
because  all  remaining  nodes  are  reachable  in  the  tree  from  this  one  son.  Also,  if  a  is  not  the  root  of  T 
and  every  son  of  a  has  a  descendant  with  a  back-edge  to  a  proper  ancestor  of  a  then  the  removal  of  a 
and  its  incident  edges  clearly  does  not  disconnect  G  and  a  is  not  an  articulation  point  | 

Hence,  if  wc  define  the  following  labeling  for  the  nodes  in  v  G  V 

(otu[u]  :=  min {pre\x\,pre[w]\  v  x  to}, 
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where  of  course  the  preorder  numbering  is  the  one  given  by  the  DFS,  we  obtain  the  following 
Corollary: 

A  node  a  different  from  the  root  is  an  articulation  point  iff  low[a\  >  prc[a )  for  a  son  s  of  a  in  T. 

It  is  immediate  to  transform  the  definition  of  low  into  the  following  local  form: 

iou>[i>]  =  min{pre[v],pre[u;],  /ou»[s];  v  w,  s  son  of  t»}, 

which  we  can  use  to  determine  the  biconnected  components  of  G. 

algorithm  biconnect; 
begin 

procedure  search(node:  v); 
begin 

num  :=  mim  +  1;  pre[v]  :=  num\  iou>[u]  :=  mim; 
for  all  edges  {v,w}  incident  on  v  do 
co  we  assume  that  all  these  edges  are  given  in  an  adjacency  list  of  v  oc 
if  pre\w]  =  0  then 

begin  co  w  hasn’t  been  visited  yet  in  the  DFS  oc 
push  {v,«;}  onto  stack; 
father[w]  v 

co  this  information  is  needed  to  distinguish  the  tree  edge  by  which  w  was  entered  oc; 
search(ie); 

if  fou)(u)j  >  pre[w]  then 
begin 

declare  v  an  articulation  point  if  it  is  not  the  root  of  the  DFS-tree  or  if  it  is  the  root  and  has 
at  least  two  sons; 

pop  all  edges  from  the  stack  up  to  and  including  {v,  w}  as  a  new  biconnected  component 
end; 

low\v]  :=  min{/<nu[v],  foto(to]} 

end 

else 

if  w  7^  father [v]  then  fo«j[v]  :=  min{foie[v],pre[u;]} 
od 

end  search; 

initialize  stack;  num  :=  0;  for  all  nodes  v  do  pre[v\  :=  0; 
search(root) 
end  biconnect. 

We  only  remark  tliat  the  algorithm  biconnect  as  it  is  formulated  automatically  determines  the 
biconnected  components  containing  edges  from  the  root  of  the  DFS-tree  because  the  condition  /ou>[u>)  > 
pre[root\  (=  1)  is  trivially  satisfied  whenever  in  search{root)  a  search(w)  for  a  son  w  of  the  root  has 
been  performed. 

Theorem: 

The  algorithm  biconnect  determines  the  biconnected  components  and  articulation  points  of  a  connected 
undirected  graph  G  —  (V,E)  in  time  0(jiJ|). 

Proof:  It  only  remains  to  verify  the  given  time  bound.  But  this  is  clear  as  the  DFS  visits  every  edge  at 
most  twice  (once  in  every  direction).  | 


7.3.  Connectivity. 
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Remark:  A  biconnected  component  consisting  of  a  single  edge  is  sometimes  called  a  bridge. 

7.3.2.  Strongly  Connected  Components. 

Let  now  G  =  (V,E)  be  a  directed  graph  (without  multiple  edges).  Call  two  nodes  v,w  £  V 
equivalent  if  there  is  'a  (directed)  path  in  G  from  v  to  w  and  one  from  w  to  v  (it  should  be  clear  that 
the  relation  among  the  nodes  of  G  such  defined  is  in  fact  an  equivalence  relation). 

Definition: 

The  subgraphs  induced  by  the  equivalence  classes  defined  by  the  above  relation  are  called  the  strongly 
connected  components  (SCC s)  of  G. 

Note  that  SCO’s  are  subgraphs  induced  by  subsets  of  the  node  set  of  the  (digraph)  G  (and  hence 
we  shall  say  that  we  have  determined  an  SCC  if  we  have  determined  its  node  set)  whereas  blocks  in 
undirected  graphs  are  given  by  subsets  of  the  edge  set  (and  hence  we  determined  blocks  by  determining 
biconnected  components). 

If  we  perform  a  DFS  (together  with  preordcr  numbering)  on  a  digraph  we  obtain  four  kinds  of  edges: 

-  tree  edges  which  are  part  of  the  spanning  forest  constructed  by  the  DFS; 

-  forward  edges  v  w  which  arc  non-tree  edges  with  pre\v\  <  pre\w\: 

-  back  edges  v  — ►  w  where  w  is  an  ancestor  of  v  in  the  DFS-forest  (and  hence  of  course  pre\w]  < 
pre[v ]); 

-  cross  edges  v  — ►  w  which  arc  edges  with  pre[w]  <  pre\v\  such  that  w  is  not  an  ancestor  of  t;  (but  has 
been  visited  before  in  the  DFS;  v  and  w  may  even  be  contained  in  different  trees  of  the  DFS-forest). 

Definition: 

Given  an  SCC  C  of  G  =  (V,E)  and  a  DFS-forest  of  G,  we  define  the  root  of  C  to  be  the  vertex 
r  of  C  with  minimal  preorder  label. 

It  is  immediate  from  the  definition  of  the  DFS  routine  that,  if  r  is  the  root  of  C ,  the  DFS  visits  all 
nodes  of  C  between  its  first  and  last  visit  to  r. 

Lemma: 

A  node  r  is  the  root  of  an  SCC  iff  there  is  no  back  edge  from  a  descendant  of  r  to  a  proper  ancestor 
of  r  and  if  there  is  no  cross  edge  from  a  descendant  of  r  to  a  node  w  such  that  the  root  of  tc/’s  SCC  is 
a  proper  ancestor  of  r. 

Proof:  The  proof  follows  immediately  from  the  above  remark  and  the  observation  that  whenever  one  of 
the  conditions  in  the  lemma  is  not  satisfied  a  node  in  r’s  SCC  different  from  r  has  been  visited  before 
by  the  DFS.  | 

Again  it  is  possible  to  define  a  labelling  of  G  which  captures  tire  criterion  of  the  Lemma: 

Definition: 


Lowlink[v\  :=  min{pre[wj;  w  can  be  reached  from  t>  via  zero 
or  more  tree  edges  possibly  followed  by  a  back 
edge  or  a  cross  edge  to.  a  node  the  root  of  whose 
SCC  is  an  ancestor  of  v}. 

Hence  the  above  Lemma  translates  into 

r  is  the  root  of  an  SCC  iff  Lowlink[r]  =  pre[rj. 
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And  wc  also  get  a  local,  recursive  formulation  of  the  definition  of  Lowlink: 

Lowlink\v)  :=  m'm{prejv],prc[w)l  Lowlink\a)\  a  is  a  son  of  v, 
v  — ►  w  is  a  back  edge,  or  t>  -+  w  is  a  cross  edge 
and  the  root  of  id’s  SCC  is  an  ancestor  of  v}. 


algorithm  see; 
begin 

procedure  search(nodc:  v); 
begin 

ntim  :=  num  +  1;  pre[v\  :=  num\  LoiWinfc(«]  :=  num; 
push  v  onto  stack; 
for  all  edges  v  -*  w  do 

co  we  assume  that  all  these  edges  are  given  in  an  adjacency  list  of  v  oc 
if  pre[w]  =  0  then 

begin  co  w  hasn’t  been  visited  before  oc 
search(w); 

Lowlink\v]  :=  m\n{Lowlink[v],  Lowlink[w}} 

end 

else 

if  0  <  pre[u;]  <  pre[v\  then 

co  we  set  pre[w ]  negative  as  soon  as  w's  SCC  has  been  completely  determined;  so  here  v  -*  w 
is  either  a  back  edge  or  a  cross  edge  with  w  still  on  the  stack  and  the  hence  the  root  of  id’s 
SCC  a  proper  ancestor  of  v  oc 
Loiufinfc[v]  :=  minlLowitnfclu],  pre\w\) 
od; 

if  pre\v]  =  Lowlink\v ]  then  co  v  is  the  root  of  a  new  SCC  oc 
pop  all  nodes  on  top  of  and  including  v  from  the  stack  and  negate  their  preorder  label 

end; 

initialize  stack;  num  :=  0;  for  all  nodes  v  do  prc\v)  :=  0; 
while  there  is  a  node  v  with  pre[v)  —  0  do  search(v) 
end  see. 

Note  that  the  while  loop  is  necessary  here  (different  from  the  case  where  the  DFS  was  applied  to 
a  connected  undirected  graph  in  the  algorithm  biconneci )  because  for  a  digraph  which  is  not  necessarily 
strongly  connected,  DFS  produces  a  spanning  forest  which  may  contain  more  than  one  tree. 

Theorem: 

The  algorithm  see  determines  the  SCC's  of  a  digraph  G  =  (V,  E)  in  time  0(\E\). 

Proof:  Again  the  correctness  of  the  algorithm  is  obvious  from  the  characterizations  given  before.  And 
the  time  bound  can  be  seen  from  the  fact  that  in  a  digraph,  DFS  visits  every  edge  exactly  once.  I 


7.4.  Planarity  Testing. 


7.4.1,  Planar  Graphs. 

A  finite  undirected  graph  G  =  (V,E)  is  called  planar  if  it  can  be  drawn  in  the  plane  (or  on 
the  surface  of  a  three-dimensional  sphere)  without  any  edges  crossing  one  another,  i.c.,  the  edges  are 


7.4.  PLANARITY  TESTING. 


43 


represented  by  connected,  finite  length  curves,  and  any  point  common  to  more  than  one  such  curve  must 
represent  a  node  of  G  incident  on  the  corresponding  edges.  A  drawing  of  a  graph  (in  the  plane  or  on 
the  sphere)  is  called  plane  if  it  satisfies  the  above  planarity  condition. 

Planar  graphs  have  been  studied  extensively  in  the  past,  both  for  theoretical  and  applied  purposes. 
We  only  refer  to  the  huge  amount  of  work  that  has  come  from  the  investigation  of  the  famous  Four 
Color  Problem  [BeW78].  There  is  also  a  strikingly  simple,  classical  characterization  of  planar  graphs  as 
given  in  the  following 

Theorem  (Kuratowski,  [Kur30]): 

A  graph  is  nonplanar  if  and  only  if  it  contains  a  homeomorphic  preimage  of  K&  or  K3t 3 . 

Remark:  Kh  is  the  complete  graph  with  5  nodes,  and  Kit 3  is  the  complete  bipartite  graph  with  3 
and  3  nodes: 


A  homcomorphism  is  a  continuous  (in  the  topological  sense)  mapping.  Here,  a  graph  is  considered  as 
a  manifold  consisting  of  finite  length  curve  segments  whose  intersections  exactly  represent  the  incidence 
relation  of  the  graph. 

For  a  proof  of  Kuratowski’s  Theorem  wc  refer  to  [Eve79]. 

Though  Kuratowski’s  criterion  is  very  simple  there  seems  to  be  no  obvious  way  to  turn  it  into  an 
efficient  algorithm  for  tcstinb  planarity  of  a  given  graph.  Before  we  develop  a,  in  fact  linear,  algorithm 
for  this  problem  let’s  first  consider  a  few  more  properties  of  planar  graphs. 

Every  drawing  of  a  graph  in  die  plane  (or  on  the  surface  of  a  sphere)  defines  (topologically)  connected 
components  which  arc  the  pieces  left  over  when  wc  cut  along  all  the  line  segments  representing  the  edges. 
For  a  plane  drawing  of  a  graph,  these  connected  components  are  called  faces.  Note  that  a  (finite)  plane 
graph  has  exactly  one  unbounded  face,  the  outer  face,  while  all  the  other,  inner  faces  are  bounded. 


Theorem  (Euler,  1736): 


Let  G  be  a  nonempty,  connected  plane  graph  with  n  nodes,  e  edges,  and  /  faces.  Then  the  following 
relation  holds: 


n  +  f  -e  =  2. 


Proof:  We  prove  the  formula  by  induction  on  e.  If  e  =  0  there  is  exactly  one  node  as  G  is  nonempty 
and  connected,  and  hence  there  is  exactly  one,  the  outer  face,  and  the  formula  holds.  Assume  the  formula 
holds  for  all  plane  graphs  with  less  than  e  edges,  for  some  e  >  0.  From  a  plane  graph  with  e  edges, 
take  away  one  edge  such  that  either  the  graph  remains  connected  or,  if  it  becomes  disconnected,  one 
component  consists  of  a  singleton  node.  This  can  always  be  achieved.  In  the  first  case  obviously  the 
number  of  nodes  remains  the  same  whereas  the  number  of  faces  decreases  by  one  (note  that  the  faces 
on  the  two  sides  of  the  deleted  edge  must  have  been  different;  otherwise,  as  a  face  is  (topologically) 
connected,  we  could  draw  a  closed  line  around  one  of  the  endpoints  of  the  deleted  edge  without  crossing 
any  edge  contradicting  the  fact  that  the  graph  is  still  connected),  and  by  induction  the  formula  holds.  In 
the  second  case,  wc  apply  the  induction  hypothesis  to  the  other  connected  component  left  after  the  edge 
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deletion.  If  we  add  to  it  the  deleted  edge  and  its  second  endpoint  we  do  not  change  the  number  of  faces 
while  increasing  the  number  of  nodes  and  the  number  of  edges  each  by  one.  Hence  the  formula  also 
holds  for  plane  graphs  with  e  edges,  completing  the  induction.  | 

As  a  consequence  of  Euler's  formula,  every  plane  drawing  of  a  planar  graph  has  the  same  number 
of  faces. 

Corollary: 

Every  planar  graph  without  self-loops  and  parallel  edges,  and  with  n  >  3  satisfies 

e  <  3n  —  6. 


Proof:  Each  side  of  an  edge  in  a  plane  drawing  of  a  graph  touches  only  one  face.  As  there  are  no  self-loops 
and  parallel  edges  the  boundary  of  each  face  is  formed  by  (one  side  of)  at  least  three  edges,  and  hence  we 
obtain  /  <  2e/3.  If  the  graph  is  not  connected  we  may  add  edges  without  destroying  its  planarity  such 
that  it  becomes  connected.  Hence  we  can  use  Euler’s  formula  and  obtain  e  =  n  +  /  —  2  <  n  -f  2e/3  —  2 
from  which  we  get  e  <  3n  —  6.  | 

Corollary: 

Every  planar  graph  without  self-loops  and  parallel  edges  has  a  node  with  degree  at  most  5. 

Proof:  If  we  assume  the  contrary  we  obtain  6n  <  2e  (every  edge  is  counted  twice,  once  for  each  endpoint) 
which  contradicts  e  <  3n  —  6.  | 

If  we  wish  to  test  planarity  of  graphs  we  may  actually  restrict  ourselves  to  biconncctcd  graphs.  As 
we  have  seen  in  one  of  the  homework  problems  the  articulation  points  of  a  graph  connect  its  blocks  in 
form  of  a  tree.  On  the  other  hand  it  is  always  possible  to  draw  a  planar  graph  in  the  plane  in  such 
a  way  that  a  specified  node  touches  the  outer  face  (the  easiest  way  to  see  this  is  to  look  at  a  “plane” 
drawing  of  the  graph  on  the  surface  of  the  sphere  where  all  faces  arc  bounded.  If  we  choose  an  interior 
point  of  a  face  adjacent  to  the  specified  node,  and  then  project,  by  a  central  projection  with  this  point 
as  center,  the  surface  of  the  sphere  onto  a  plane  tangent  to  the  sphere  on  the  opposite  side,  we  obtain  a 
plane  drawing  of  the  graph  with  the  specified  node  on  the  perimeter  of  the  drawing).  Hence,  if  we  have 
a  planar  drawing  of  a  block,  we  can  find  planar  drawings  (if  they  exist  at  all)  of  its  children  in  the  tree 
of  blocks  such  that  for  each  of  them  the  connecting  articulation  point  lies  on  the  outside,  and  we  can 
attach  these  drawings  to  the  drawing  of  the  articulation  points  in  die  father  block  and  still  obtain  a  plane 
drawing.  We  therefore  assume  in  the  sequel  that  G  =  (V,E)  is  a  biconnected  undirected  graph  without 
self-loops  and  parallel  edges  (which  also  could  be  added  later  to  a  plane  drawing). 

Let  C  =  (V',  E')  be  a  (simple)  cycle  in  G,  and  let  E"  be  E  n  ((K  —  V')  X  (V  -  V')). 

Definition: 

(a)  A  connected  component  of  (V  -  V',  E")  together  with  the  nodes  in  V'  linked  to  this  connected 
component  by  an  edge  in  E  —  E'  and  together  with  these  edges  is  called  a  bridge  of  G  with  respect 
to  C.  The  nodes  both  in  G  and  the  bridge  arc  called  the  attachments  of  the  bridge. 

(b)  An  edge  in  E  —  E'  with  both  endpoints  on  G  is  called  a  singular  bridge. 

Definition: 

Two  bridges  D  and  13'  (of  a  graph  G  with  respect  to  a  cycle  C  in  G)  interlace  if 
(a)  II  has  attachments  a,  b,  the  bridge  O'  has  attachments  c,  d,  all  four  arc  distinct,  and  they  appear  on 
G  in  the  order  a,  b,  c,  d\  or  if 
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(b)  B  and  B‘  have  at  ieast  three  attachments  in  common. 

Lemma: 

Let  Bi, . . .  ,Br  be  a  set  of  bridges  (with  respect  to  cycle  C)  such  that  no  two  of  them  interlace  and 
such  that  C  +  B,  (the  subgraph  of  G  given  by  the  nodes  in  C  or  Bt)  is  planar,  for  every  i  =  1, . . . ,  r. 
Then  C  +  Bt  + ...  +  B,  is  planar. 

Proof:  In  a  plane  drawing  clearly  every  bridge  must  be  drawn  completely  on  the  inside  or  the  outside 
of  (the  drawing  of)  C.  Let  the  nodes  on  C  be  in  order  t>,, . . . ,  vp,  vx  and  assume  inductively  that  the 
Lemma  holds  for  any  set  of  r  —  I  (non-interlacing)  bridges,  r  >  1.  As  the  bridges  Bit ...,Br  do  not 
interlace  we  can  find  a  bridge  By  such  that,  if  vt  is  the  lowest  and  vh  the  highest  (in  the  above  order  of 
the  nodes  on  C)  attachment  of  By  no  other  bridge  has  an  attachment  v{  with  l  <  i  <  h.  By  induction 
we  can  obtain  a  planar  drawing  of  C  +  B,  +  . . .  +  By_,  +  By+,  +  . . .  +  B,  (actually  with  all  these 
bridges  drawn  on  the  same  side  of  C )  and  also,  by  assumption,  a  plane  drawing  of  By  together  with 
the  edges  on  C  between  v,  and  «*.  If  l  h  we  can  “squeeze”  the  latter  into  the  one  bounded  face 
of  the  first  drawing  which  has  the  edges  of  C  between  v,  and  vh  at  its  boundary,  and  if  /  =  h  we  can 
insert  the  plane  drawing  of  By  in  any  (bounded)  face  with  v,  on  its  boundary  and  thus  obtain  a  plane 
representation  of  C  +  B,  +  . . .  +  Br.  Hence  the  induction  is  complete.  | 

Theorem: 

Let  Bi . Br  be  the  bridges  of  G  with  respect  to  C.  Then  G  is  planar  iff 

(a)  C  +  Bi  is  planar  for  every  t,  i  —  1, . . . ,  r;  and 

(b)  the  set  of  bridges  can  be  partitioned  into  two  subsets  such  that  no  two  bridges  in  the  same  subset 
interlace. 

Proof:  If  we  have  a  plane  drawing  of  G  the  partition  of  the  bridges  of  G  into  those  drawn  on  the 
inside  respectively  outside  of  C  clearly  satisries  the  condition  in  the  Theorem.  The  other  direction  is 
a  consequence  of  the  previous  Lemma  where  we  have  in  fact  shown  that  every  set  of  non-interlacing 
bridges  can  be  drawn  on  one  side  of  G.  1 

The  planarity  testing  algorithm  presented  in  the  next  section  makes  use  of  this  characterization.  It 
recursively  tests  whether  C  +  Bi  is  planar,  and  tries  to  partition  the  set  of  bridges  with  respect  to  C  into 
two  subsets  as  stated  in  the  Theorem.  In  order  to  find  C  and  the  bridges  a  DFS  is  used. 

7.4.2.  The  Hopcrofi-Tarjan  Planarity  Testing  Algorithm. 

Assume  that  DFS  is  applied  to  G,  and  that  the  nodes  are  renamed  such  that  v  —  pre\v\  for  every 
v  €  V.  We  also  assume  that  during  the  DFS  low[v]  (as  defined  in  7.3.1.)  has  been  computed.  For  the 
proper  parsing  of  G  we  use  still  another  labelling  of  the  nodes  given  by 

low2(v)  :=  min{v,  /ou>jtu);  w  ^  low[v),  v  x  w}. 

The  label  low 2[o]  essentially  gives  the  second  lowest  node  (but  at  least  v  itself)  reachable  from  v  along  a 
branch  in  the  DFS  tree  and  a  back-edge.  Using  these  auxiliary  functions  wc  assign  the  following  weight 
c(v,  w)  to  every  edge  (u,  w)  of  G  (the  edges  are  now  considered  directed  according  to  the  DFS): 

(2w  if  v  tt»; 

2low[w]  if  v  — *  w  and  low2[w]  >  v; 

2iou>|ti/]  +  1  if  v  -*  w  and  low2\w\  <  v; 

and  order  the  edges  in  the  adjacency  lists  of  G  (as  the  edges  arc  directed  now  each  edge  occurs  in  exactly 
one  such  list)  in  order  of  nondccrcasing  weight  Using  bucket  son  this  can  be  done  in  linear  time. 
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Wc  now  describe  an  algorithm  find-path  which  first  determines  a  cycle  C  of  G  through  the  root  of 
the  DFS  tree,  and  which  then  outputs  a  sequence  of  simple  paths  which  together  form  the  set  of  bridges 
of  G  with  respect  to  C.  Because  of  the  way  in  which  the  adjacency  lists  of  G  have  been  reordered  these 
paths  are  generated  in  a  order  which  will  help  us  to  make  efficient  use  of  the  planarity  criterion  of  the 
last  Theorem  in  the  previous  section. 

algorithm  find-path; 
begin 

co  we  are  given  a  DFS  tree  of  the  biconnected  graph  G  together  with  the  back-edges;  each  edge  of 
G  appears  (as  directed  edge)  in  exactly  one  adjacency  list,  and  the  adjacency  lists  are  reordered  as 
described  above;  the  nodes  are  named  1, ... ,n  in  the  order  they  were  visited  in  the  DFS  oc 
mark  all  edges  “new"; 
t,:=l;C:={l}; 

let  v  -+  u  be  the  first  (in  adjacency  list)  edge  leaving  v\  mark  it  “old”; 

while  ti  7^  1  do 

begin 

co  as  G  is  biconnected  and  because  of  the  reordering  of  the  adjacency  lists  wc  are  guaranteed  to 
reach  the  root  again  oc 
C  :=  C  U  {«};  v  :=  u; 
u  :=  endpoint  of  first  (“new")  edge  leaving  v; 
mark  v  — ►  u  “old” 
end; 

output  the  (simple)  cycle  C; 

co  note  v  now  is  the  highest  numbered  node  on  C  oc 
while  v  ^  1  do 

co  as  G  is  biconnected  there  is  only  one  tree  edge  from  the  root  oc 

if  there  is  a  “new”  edge  leaving  v  then 

begin 

u  :=  endpoint  of  first  “new"  edge  leaving  v\ 

P  :={»,«}; 

while  v  <  ti  co  v  -*  u  is  not  a  back-edge  oc  do 
begin 

v  :=  «; 

it  :=  endpoint  of  first  “new"  edge  leaving  v 
co  such  an  edge  exists  because  G  is  biconnected  oc; 

P  :=  P  U  {it} 
end; 

output  next  path  P 
end 

else  v  fathcr\v\ 
od 

end  find-path. 

It  is  clear  from  the  properties  of  the  DFS  tree  that  find-path  uses  time  0(1^1)  and  first  outputs 
a  simple  cycle  C  through  node  1  (the  root),  and  then  a  sequence  of  paths  P,  each  of  which  is  simple 
(because  it  consists  of  zero  or  more  "new”  tree  edges  followed  by  a  “new”  back-edge  leading  to  a  proper 
ancestor  of  P’s  first  node,  again  because  G  is  biconnected  and  also  because  there  arc  no  self-loops)  and 
has  exactly  two  nodes  in  common  with  previously  generated  paths,  namely  its  first  and  last  node.  Wc 
summarize  the  properties  of  find-path  which  wc  shall  use  in  the  following 
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Observations: 

(a)  Let  Pbc/-*»-»...  -*  l  be  a  path  generated  by  find-path.  If  v  ^  l  then  l  =  fotojv),  and  in 
every  case  l  is  the  lowest  node  reachable  from  f  via  a  path  of  the  form  f  v  w  where  v  w  has  not 
been  included  in  a  previous  path  (automatically  true  if  f  ^  v). 

(b)  Let  Pi  :  ft  l\,.Pj  :  ft  It  be  two  paths  such  that  Px  is  generated  before  Pt  by  find-path,  and 
assume  that  ft  ft,  i.e.,  that  ft  is  a  descendant  of  f\  in  the  DFS  tree  (possibly  fi  =  ft).  Then 
/,  <  If  This  is  an  immediate  consequence  of  the  fact  noted  above  that  the  endpoint  of  a  path  is 
always  the  lowest  point  reachable  via  “new”  tree  edges  followed  by  a  “new"  back-edge. 

(c)  The  following  property  makes  use  of  the  auxiliary  function  low2.  Let  Pt  :  /  — ►  v  -»  . . .  -*  l,  and 
Pt  :  f  -+  w  -*  ...  — »  1  be  two  paths  generated  by  / ind-path  with  the  same  first  and  last  node 
such  that  again  Pt  is  generated  before  Pt,  and  assume  that  o  ^  I  and  low2[v]  <  f.  Then  also 
w  7^  l  and  low2[w]  <  /.  The  reason  is  that  otherwise  /  -»  w  would  have  to  come  before  /  — *  v 
in  the  adjacency  list  of  f,  because  of  the  definition  of  the  low 2  function  and  the  reordering  of  the 
adjacency  lists. 

Lemma: 

Let  B  be  a  non-singular  bridge,  and  i  its  largest  attachment  Then  B  is  entered  via  a  tree  edge 
from  i,  completely  explored  before  find-path  backs  up  to  i  again,  and  all  other  attachments  of  B  are 
back-edges. 

Proof:  Because  every  path  constructed  by  find-path  terminates  as  soon  as  a  back-edge  is  encountered, 
certainly  no  node  of  B  which  is  not  an  attachment  is  visited  before  find-path  backs  up  to  i  and  traverses 
an  edge  i  -+  w  belonging  to  B.  As  B  is  non-singular  t  -*  w  cannot  be  a  back-edge  and  hence  must  be 
a  tree  edge.  As  B  is  connected  by  definition  the  DFS  explores  all  edges  belonging  to  it  before  backing 
up  to  i,  and  hence  all  the  other  attachments  of  B  must  be  back-edges.  | 

Lemma: 

Assume  we  have  a  plane  drawing  of  C  +  £?i  +  ...  +  £(_i,  where  Blt...,Bi- 1  are  the  bridges 
explored  by  / ind-path  so  far,  and  let  t  -♦  . . .  -*  j  be  the  first  path  of  the  next  bridge  B  =  B,  output 
by  find-path.  Then  B  cannot  be  added  to  the  inside  (resp.,  outside)  of  C  if  there  is  a  back-edge  of 
one  of  Bi,. ..,  Bt_,  drawn  inside  (resp.,  outside)  of  C  and  ending  in  a  node  k  with  j  <  k  <  i. 

Proof:  Assume  that  there  is  a  back-edge  ending  in  k  with  j  <  k  <  i,  and  let  this  back-edge  be  part 
of  Bt>,  l'  <  l.  Also,  let  i'  be  the  highest  numbered  attachment  of  Be.  Then,  because  of  the  previous 
Lemma,  we  have  i'  >  i.  If  i'  >  i  the  bridges  B,  and  Be  interlace  according  to  the  first  part  of  the 
definition  of  interlacing  as  j,i  are  attachments  of  Bt,  the  nodes  k,i'  are  attachments  of  Be,  all  four 
of  them  are  distinct  and  they  occur  on  C  in  the  order  j,k,i,i'.  On  the  other  hand,  if  i  =  i',  let 
-»  j'  be  the  first  path  of  Br  produced  by  find-path.  Then  j'  lies  on  C,  and  because  of 
observation  (b)  about  the  properties  of  find-path  we  know  that  j'  <  j.  Now  we  have  again  two  cases. 
If  j'  <  j  the  two  bridges  Bi  and  Be  interlace  as  before  by  the  first  part  of  the  definition.  If  however 
j'  =  j  we  note  that  Be  cannot  be  a  singular  bridge  because  it  also  has  the  attachment  k,  and  hence  part 
(c)  of  our  observations  about  find-path  applies  and  allows  us  to  conclude  that  the  new  bridge  Bi  must 
have  a  third  attachment  k'  with  j  <  k'  <  i.  Depending  now  on  whether  k  =  k'  or  not,  we  obtain  that 
B,  and  Be  interlace  according  to  the  second  respectively  first  part  of  the  definition. 

In  any  case  we  have  thus  shown  that  under  the  conditions  of  the  Lemma  the  new  bridge  Bi  would 
interlace  with  some  other  bridge  drawn  on  the  same  side  of  C  which  is  impossible.  | 

We  are  now  able  to  describe  the  final  algorithm  for  planarity  testing.  After  having  determined  the 
cycle  C,  it  proceeds  by  adding  to  a  plane  drawing  of  C  plane  drawings  the  bridges  in  the  order  in  which 
they  arc  explored  by  find-path.  As  at  any  point  the  bridges  drawn  inside  and  those  drawn  outside 
of  C  could  be  swapped,  die  algorithm  tries  to  draw  every  new  bridge  on  the  inside  of  C  after  having 
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flipped  bridges  drawn  earlier  and  standing  in  the  way  to  the  outside.  While  find-path  is  outputting 
paths  within  one  and  the  same  bridge  the  algorithm  checks  recursively  whether  this  bridge  together  with 
the  segment  of  C  between  its  lowest  and  highest  numbered  attachment  is  planar. 

The  previous  Lemma  gave  a  condition  for  bridges  which  cannot  be  drawn  on  the  same  side  of  C.  In 
general,  drawing  one  of  such  bridges  on  one  side  of  C  fixes  some  other  bridges  to  be  drawn  on  the  other 
side  of  C  or  again  on  the  same  side.  As  bridges  are  determined  by  the  back-edges  to  their  attachments 
we  can  formalize  this  situation  into  the  following 

Definition: 

A  block  of  back-edges  (to  attachments  on  C)  is  a  maximal  set  of  back-edges  such  that  putting  one 
of  the  back-edges  (and  hence  the  bridge  it  belongs  to)  on  one  side  of  C  implies  the  positions  of  all  the 
other  back-edges  (and  their  bridges). 

The  planarity  testing  algorithm  maintains  two  lists  5,  and  S0  with  the  back-edges  which  are  currently 
drawn  on  the  inside  respectively  the  outside  of  C.  The  elements  within  each  list  are  ordered  according 
to  nondecreasing  value  of  the  end  nodes  of  the  back-cdgcs  (actually  it  is  sufficient  to  keep  the  lists  of 
these  nodes  which  might  then  of  course  contain  repetitions).  Now  part  of  the  back-cdgcs  of  a  block  may 
be  contained  in  S,  and  part  in  Sa  but  within  each  of  the  two  lists  the  elements  of  a  block  appear  in  a 
nice  order. 

Lemma: 

Let  j  and  i  be  the  lowest  respectively  highest  numbered  attachment  of  back-edges  in  some  block.  If 
a  back-edge  has  attachment  k  with  j  <  k  <  i  then  this  back-edge  belongs  to  the  block. 

Proof:  We  are  going  to  prove  the  Lemma  by  induction  on  the  number  of  bridges  explored  at  any  moment. 
The  claim  is  certainly  true  as  long  as  at  most  one  bridge  (with  respect  to  (?)  has  been  explored  (assuming 
that  it  together  with  C  was  found  planar  by  a  recursive  application  of  the  algorithm)  because  then  all 
elements  in  5,  (Sc  is  empty)  belong  to  one  bridge  and  hence  one  block.  Now  assume  that  bridges 
Du. . . ,  B|_  i  have  been  explored  so  far  for  some  /  >  1,  and  that  the  Lemma  holds  at  the  moment  before 
the  first  path  of  the  next  bridge  B,  is  being  output  by  Jind-path.  Let  this  path  be  .  — »  j'.  As 

we  may  decide  arbitrarily  to  draw  B ,  at  this  moment  on  the  inside  of  C  (if  it  turns  out  to  be  planar  at 
all  itself)  the  previous  Lemma  tells  us  that  all  bridges  which  have  attachments  k  with  j'  <  k  <  »'  and 
are  currently  drawn  on  the  inside  of  C,  have  to  be  flipped  to  the  outside.  What  is  more,  all  blocks  with 
such  an  attachment  k  are  forced  to  be  drawn  on  the  other  side  than  B,.  Hence  we  can  combine  all  the 
back-edges  of  B,  and  all  edges  in  blocks  with  attachments  k  between  j'  and  i'  as  above  to  form  a  new 
block.  If  there  are  only  attachments  k  with  j'  <  k  <  V  from  back-edges  on  S„  their  position  clearly  is 
also  determined  by  the  position  of  B,,  and  they  belong  to  the  same  block  as  the  back-cdgcs  of  Bt.  Hence 
we  have  established  the  claim  of  the  Lemma  for  the  moment  after  the  exploration  of  one  more  bridge, 
and  thus  completed  the  induction.  | 

As  a  consequence  of  this  Lemma  we  note  that  the  back -edges  belonging  to  one  block  appear 
contiguously  on  the  lists  S,  and  S„.  As  we  have  to  be  able  to  (lip  whole  blocks  from  one  side  of  C  to 
the  other  whenever  there  occurs  a  conflict  with  the  first  path  of  a  new  bridge,  we  add,  for  each  pan  of  a 
block  on  one  of  the  two  lists  S,  and  Se,  a  pointer  from  the  last  element  of  the  block  in  the  list  to  its  first 
element  Using  these  pointers  it  is  possible  to  combine  the  sublists  of  two  adjacent  blocks  in  constant 
time  and  also  to  find  the  blocks  which  overlap  with  a  new  path. 

The  algorithm  is  now  straightforward.  Whenever  find-path  outputs  the  first  path  P  of  a  new 
bridge  we  determine  in  S,  and  S„  the  blocks  which  interfere  with  the  new  bridge  and  hence  have  to 
be  combined  with  it  into  a  new  block.  We  then  flip  all  blocks  in  S,  with  attachments  between  the  two 
endpoints  of  P  to  Sa.  If  after  this  swap  there  arc  again  conflicting  back-cdgcs  on  S,  (because  they  were 
flipped  in  from  S0)  we  stop  and  declare  the  graph  non-pianar.  Otherwise  we  insert  a  special  marker  in 
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S0  at  the  position  of  P's  starting  point  and  check  recursively  whether  the  new  bridge  together  with  the 
segment  of  C  between  the  two  endpoints  of  P  (let  us  call  die  cycle  consisting  of  P  and  this  segment  C'\ 
it  plays  the  role  of  C  in  the  recursive  step)  is  planar,  using  the  same  lists  St  and  S„.  After  the  algorithm 
has  completely  explored  the  new  bridge.jand  determined  that  it  is  planar)  we  still  have  to  make  sure  that 
all  bridges  of  this  new  bridge  with  respect  to  C'  which  have  attachments  to  the  segment  of  C  between 
the  two  endpoints  of  P  can  in  fact  be  drawn  on  the  inside  of  C'  (which  they  obviously  have  to  if  the  new 
bridge  is  to  be  drawn  on  the  inside  of  C).  This  can  be  done  by  flipping  all  those  blocks  with  back-edges 
in  S0  after  the  special  marker  introduced  before  the  recursive  step.  If  after  this  flipping  process  there  are 
still  (or  again)  back-edges  after  this  marker  we  again  stop  because  the  graph  is  non-planar.  Otherwise  the 
algorithm  proceeds  to  explore  the  next  bridge. 

Theorem: 

The  Hopcroft-Tatjan  path  addition  algorithm  tests  whether  a  graph  is  planar  in  time  0(\V\). 

Proof:  Clearly  the  initial  DFS  with  the  renaming  of  the  nodes  and  the  computation  of  the  auxiliary 
functions  takes  time  linear  in  \V\  if  we  check  during  the  search  that  in  fact  \E\  <  3|F|  (otherwise  we 
may  stop  immediately!).  Time  linear  in  |  V  (  is  also  sufficient  for  the  reordering  of  the  adjacency  lists 
(done  using  a  bucket  sort)  and  the  find-path  algorithm  because  it  is  essentially  another  DFS.  The  total 
number  of  elements  in  the  lists  5,  and  S0  is  bounded  by  the  number  of  back-edges,  and  hence  0(|F|). 
The  union  of  two  blocks  can  be  done  in  constant  time,  and  as  the  number  of  blocks  decreases  by  one 
after  each  such  union,  there  can  be  at  most  O(jVj)  such  operations.  Hence  we  obtain  a  total  time  of 
0(|V|).  | 


IS.  Shortest  Path  Problems. 

Suppose  we  are  given  a  (directed  or  undirected)  graph  G  =  (V,  E).  a  length  d(x,  y)  >  0  for  every 
edge  x  — *  y  (if  G  is  undirected  we  assume  that  d{x,y)  =  d(y,x)  for  all  edges;  we  also  assume  for 
notational  convenience  that  d[x,  y)  =  -t-oo  if  the  edge  x  y  is  not  present  in  G),  and  we  want  to  know 
the  length  dts[r,u>]  of  a  shortest  path  between  two  nodes  v  and  w.  Le„ 

t 

dt8[v,u/]  =  min{]T]  d(x)-i,xi);  v  =  z0  — »  Zi  -♦  . . .  -*  xr_t  -*  xr  =  w  is  a  path  in  G}. 
i- 1 

(Hence,  by  default,  dt8|v,ia]  =  oo  if  there  is  no  path  v  w.) 

Shortest  path  problems  are  usually  divided  into  the  following  categories: 

(a)  single  pair  shortest  path  problem :  find  the  distance  between  a  given  pair  of  nodes; 

(b)  single  source  shortest  paths  problem :  find  the  distance  between  a  given  node  and  all  other  nodes; 

(c)  all  pairs  shortest  paths  problem:  find  die  distance  between  every  pair  of  nodes. 

It  turns  out  that  no  algorithm  for  (a)  is  known  which  would  not  at  the  same  time  essentially  solve 
problem  (b). 

7.5.1.  Dijkstra's  Single  Source  Algorithm. 

We  construct  a  bigger  and  bigger  set  $  of  nodes  whose  distance  in  G  from  some  given  node  a  is 
known. 

algorithm  singlc_sourcc_shortcsl_path; 

co  determine  the  distance  from  node  a  to  all  v  G  V;  the  graph  G  =  {V,E)  is  given  by  adjacency  lists; 
d[v,ta]  >  0  is  tine  length  of  edge  v  -*  w  (+oo  if  not  present)  oc 
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begin 

5  :=  {a};  rftsfaj  :=  0; 

for  all  v  £  V  —  {a}  do  dtsfv]  :=  d[a,v\‘, 

while  |S|  <  | V |  do 

begin 

determine  v  6  V  -  S  with  minimal  dts[v]; 

S:=SU{»}; 

for  all  ( v ,  w)  with  w  £  V  —  S  do 
dis[w ]  :=  min{dis[io],dta[v]  +  d[v,  to]} 

end 

end  single_source_shortest_path. 

Theorem: 

The  algorithm  singlesourceshortest-path  correctly  determines  the  distance  from  a  to  every 
v  £  V,  in  time  0(|F|2). 

Proof:  We  only  have  to  verify  the  correctness  of  the  algorithm,  the  time  bound  is  immediate. 

Whenever  |S|  <  IVJ  is  checked,  disju]  is  the  distance  from  a  to  v  for  all  v  £  S.  This  is  certainly 
true  before  the  first  execution  of  the  loop.  Suppose  now  that  just  bcfo:e  some  later  execution,  v  €  V  —  S 
is  such  that  dis\v J  is  minimal,  assume  that 

—*  Vq  -*  V 

is  a  shortest  path  from  a  to  v  with  6  S  and  u,  S  (note  that  r  =  0  and  q  =  0  are 

possible),  and  also  assume  now  by  contradiction  that  this  path  is  strictly  shorter  than  dis\v].  But, 
if  q  >  0,  the  length  of  a  -*  a,  -♦  . . .  -*  v,  is  at  most  the  length  of  the  whole  path  and  hence 
<  tfts[w,]  <  dis[v],  contradicting  the  choice  of  v.  Hence  q  must  be  zero  and  tfis[v]  is  correct  because 
it  was  set  to  dtsjs,]  +  <£[»,,«]  (resp.,  d[a,  v],  if  r  =  0)  in  some  earlier  traversal  of  the  inner  loop  and 
because  dt8[sr]  was  then  correct  by  the  induction  hypothesis.  | 

Remarks: 

(a)  Using  appropriate  data  structures  (priority  queues  with  update  operations)  we  can  implement 
algorithm  singlesourceshortest-path  to  run  in  time  0(|E| log|V|)  or  0(fc|£|  +  fc|V|1+l/*) 
for  any  positive  integer  k. 

(b)  A  similar  algorithm  (though  with  running  time  0(|V|3)  is  possible  for  the  single  source  shortest 
path  problem  where  we  allow  arbitrary  edge  lengths  but  no  negative  length  cycles  (sec  homework 
problem). 

For  more  information  on  these  extensions  see  (Joh73J. 

7.5.2.  The  AH  Pairs  Shortest  Paths  Problem 

Now  suppose  we  want  to  compute  the  length  of  a  shortest  path  between  all  pairs  of  vertices.  Let 

. . vn  be  the  nodes  of  G.  Suppose  the  distance  between  x  and  y,  d[x,y )  satisfies  d\x,y]  >  0. 

Define  cl**  to  be  the  length  of  a  shortest  path  from  v,  to  v}-  containing  as  internal  nodes  only  tv  with 
k'  <  k.  A  good  algorithm  to  solve  this  problem  (especially  if  the  graph  is  dense)  is  Floyd’s  algorithm: 
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begin 

for  all  (t,j)  do  :=  d[i,j\  co  0  if  t  =  j  oc; 
for  k  :=  1  to  n  do 
for  all  (i,j)  do 


(k)  .  r  (*-l)  (*-l)  .  (Ik  — lli 

minlc-j  \c|*  +  «*>  '} 


end. 


To  see  that  this  algorithm  works  it  is  necessary  to  observe  that  every  shortest  path  is  simple.  It  is 
then  an  easy  induction  on  k  to  show  that  ej**  is  the  length  of  a  shortest  path  from  i  to  j  with  internal 
nodes  only  vk>  with  k'  <  k.  If  vk  does  not  appear  on  the  shortest  path  from  i  to  j  containing  as  internal 
nodes  only  vk,  with  k'  <  k  then  e-J*  =  Otherwise,  vk  appears  on  the  path  exactly  once. 

The  algorithm  clearly  requires  0(|V|3)  time. 


7.5.3.  Min-Plus  Transitive  Closure. 


The  all  pairs  shortest  path  problem  can  also  be  solved  using  matrix  methods.  It  is  possible  to  define 
a  matrix  product  with  other  operations  in  place  of  addition  and  multiplication.  Specifically  we  will  be 
interested  in  what  results  when  addition  is  replaced  by  the  minimum  operation  and  multiplication  is 
replaced  by  addition,  i.e.,  if  C  —  AB  then  =  min* {a,*  +  bk]).  If  A  gives  the  length  of  edges 
in  a  graph,  i.  e.  ai}  =  d[v,,  v3\  between  nodes  t>,  and  r,,  then  A2  using  the  Min-Plus  product 
gives  the  minimal  distance  along  paths  consisting  of  two  edges,  A3  three  edges,  and  so  on.  Setting 
A‘  =  min.^o^'  turns  out  to  be  well  defined.  Here,  A0  is  the  matrix  with  0  on  the  diagonal  and  +oo 
everywhere  else.  It  functions  as  an  identity  matrix  with  respect  to  this  product.  The  other  powers  are 
defined  by  Am+1  =  AAm  for  m  >  0.  This  matrix,  A" ,  is  called  the  (Min-Plus)  transitive  closure  of  A. 
It  is  the  solution  to  the  all  pairs  shortest  paths  problem,  if  A  is  the  distance  matrix.  We  shail  later  prove 
that  the  complexity  of  transitive  closure  is  the  same  as  that  of  matrix  multiplication  (to  within  a  constant 
factor).  It  is  therefore  interesting  to  look  for  good  implementations  of  Min-Plus  matrix  multiplication. 
It  should  be  noted  that  the  o(n3)  algorithms  of  Strassen  and  others  for  Plus-Times  matrix  multiplication 
will  not  work  because  the  min  operation  docs  not  have  a  well  defined  inverse. 

There  is,  however,  an  0(n5/2)  algorithm  for  this  problem  using  a  different  model  of  computation. 
This  is  the  decision  tree  model.  In  this  model  different  code  may  be  executed  depending  on  the  results 
of  each  comparison.  This  code  will  depend  on  n,  the  size  of  the  matrix  given  by  the  number  of  rows  or 
columns  it  has.  The  <9(n5/2)  algorithm  is  too  complicated  to  be  discussed  here,  so  an  0(n5,'2(logn)l/*) 
algorithm  will  be  discussed  instead.  Both  algorithms  are  described  in  more  detail  in  [Fre76], 

We  wish  to  compute  uie  Min-Plus  product  AIJ,  where  A  =  (ai})  and  B  =  We  divide  A 
into  n  X  m  submatrices,  A.  The  value  of  m  will  be  determined  later.  Similarly  we  divide  B  into 
m  X  n  submatrices,  £?,.  The  product,  AB  is  min(vd1S1, . . .  ,An/mBn/m).  If  the  A,B>  have  somehow 
been  computed,  computing  AB  takes  n3/m  time  since  there  are  n/m  matrices  in  the  minimum  and 
processing  each  one  requires  0(n 2)  time. 

To  compute  A\Bl%  for  example,  we  consider  each  1  <  r  <  s  <  m.  For  each  such  pair  we  sort  the 
2 n  differences  n„  —  a„  and  bti  —  brj,  for  1  <  t  <  n  and  1  <  j  <  n.  For  all  pairs  of  r  and  s  this 
takes  0(m2nlog(n))  time.  Computing  an  entry,  c„  of  AtB ,  is  the  same  as  computing  the  t{i,j)  such 
that  a,'(  +  btj  is  minimum.  Since  air  +  6,y  <  a*,  -f  b ,,  if  and  only  if  air  -  ait  <  b,j  —  br],  once  the  sort 
is  done  determining  t(i,  j)  can  be  done  in  constant  time  in  the  decision  tree  model  since  it  is  completely 
determined  by  the  result  of  the  sort. 

Therefore  computing  the  A<Bt  requires  0(mn2log(n))  time.  If  m  =  n1/2/(log(n))1/2  the  whole 
algorithm  takes  C>(ns/2(logn)1''2)  time.  If  n  is  sufficiently  large  it  is  possible  to  precompute  small 
decision  trees.  This  results  in  an  <7(n3(loglogn/logn)l/3)  time  algorithm. 

This  algorithm  also  works  for^thc  boolean,  Or-Acd.  matrix  product. 

Yao  and  others  have  proved  that  under  the  decision  tree  model  at  least  fl(n2  logn)  time  is  required 
to  compute  the  transitive  closure.  [YAR77] 
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7.5.4.  Boolean  Matrix  Multiplication,  Transitive  Closure 

We  now  want  to  prove  the  equivalence  of  matrix  multiplication  and  transitive  closure  as  far  as 
computational  complexity  is  concerned.  We  will  consider  the  boolean  Or-And  product  here.  The  proof 
will  also  work  for  the  Min-Plus  product  discussed  earlier.  To  aid  notation  we  will  denote  the  additive 
operation  by  “V"  anu  the  multiplicative  one  by  “A”.  The  Or-And  product  is  then  c,3  =  V*a<*  A  bkJ 
for  the  t.i  jduct  C  =  AD.  To  get  an  intuitive  idea  of  what  is  going  on  it  is  useful  to  suppose  that  the 
matrices  arc  adjacency  matrices.  Let  A'  be  the  transitive  closure  of  A.  Then  intuitively,  ai3  =  1  if  and 
only  if  there  is  a  path  from  t  to  j  in  the  graph. 

Let  T(n)  be  the  time  to  compute  the  transitive  closure  of  n  X  n  matrices.  Let  M(n)  be  the  time  to 
compute  the  boolean  product  of  two  n  X  n  matrices. 

Theorem: 

If  T(3n)  <  cT(n)  and  M(2n)  >  4Af(n)  then  T(n)  =  6(M(n)). 

The  assumptions  are  reasonable  since  the  first  one  is  satisfied  if  the  transitive  closure  can  be  computed 
in  polynomial  time  and  the  second  is  satisfied  if  matrix  multiplication  takes  time  0(n2). 

Proof:  First  we  reduce  M,  matrix  multiplication,  to  T ,  transitive  closure.  Suppose  we  want  to  compute 
C  =  AB.  Let 


L  = 


0  A  0\ 
0  0  2?. 
0  0  0  / 


Intuitively,  this  corresponds  to  a  tripartite  graph  with  nodes  i,,  y,,z,  for  1  <  t  <  n.  There  is  an  edge 
from  x,  to  y3  if  and  only  if  at]  =  1  and  an  edge  from  y,  to  z3  if  and  only  if  btj  =  1.  There  are  no 
other  edges.  It  is  easy  to  see  that 


(l  A  AB\ 
L*  =  1  0  J  B  )• 
loo  I  J 


Therefore  M(n)  <  T(3n)  =  0(T(n)). 

Suppose  now  we  want  to  compute  L‘ .  Without  loss  of  generality  wc  can  assume  that  n  is  a  power 


of  2.  Subdivide 

b\ 

\c  D) 

Let 

L*~  (E  F\ 

l  G  II) 

It  is  then  easy  to  verify  that 

E  =  (A  V  BD'C)* 

F  =  EBD\ 
G  =  D'CE, 


H  =  D‘  V  GF. 


Consider  for  example  the  first  equation.  Think  of  L  as  an  adjacency  matrix  of  a  graph.  Partition  the 
nodes  into  two  sets  X  and  Y ,  so  that  A  is  the  adjacency  matrix  for  the  subgraph  induced  by  X .  Consider 
a  path  between  two  nodes  in  X.  Partition  the  path  every  time  it  touches  a  node  in  X.  Bach  piece  will 
cither  be  an  edge  between  two  nodes  in  X  or  will  go  from  a  node  in  X  to  a  node  in  Y.  possibly  go  to 
some  other  nodes  in  V  and  go  back  to  A".  In  the  first  case  the  path  is  represented  by  an  entry  in  A. 
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In  the  second  case  it  is  represented  by  an  entry  in  BD'C.  The  transitive  closure  of  (A  v  BD'C)  then 
represents  all  paths  between  elements  of  X.  Therefore,  E  =  (A  V  BD'C)' ,  which  is  what  we  wanted. 

To  find  L ‘  two  transitive  closures,  six  matrix  multiplications,  and  two  componentwise  V’s  all  on 
matrices  of  size  n/2  are  needed.  Therefore 

T(n)  <  2 T(£)  +  6 Af(J)  +  c'n*. 

It  is  then  possible  to  prove  that  T(n)  <  cM(n)  by  induction.  The  basis  is  obvious  if  c  is  large  enough. 
By  the  inductive  hypothesis, 

T(n)  <  2cA/(  J)  +  6 M( J)  +  c'n*  <  ^(2c  +  6  +  4c')Af(n), 

2  2  4 

since  M(n)  >  nJ.  The  induction  goes  through  if  c  >  c/2  +  3/2  +  c'  or  e  >  3  +  2c\  | 

7.5.5.  The  Four  Russians'  Algorithm  for  Boolean  Matrix  Multiplication. 

We  are  now  interested  in  computing  the  boolean  matrix  product.  The  method  of  section  7.5.3.  will 
work.  Warshall  published  one  of  the  earliest  algorithms  to  solve  this  problem  [War62J.  It  is  also  possible 
to  consider  the  boolean  matrices  to  be  integer  matrices  and  use  a  Plus-Times  product  algorithm.  It  will 
then  be  necessary  to  remember  that  any  nonzero  entry  is  really  one.  Another  interesting  method  that 
lends  itself  to  vector  operations  was  published  in  [ADK70].  Suppose  we  want  to  compute  C  =  AB.  As 
with  the  Min-Plus  product  we  divide  A  into  n  X  m  submatrices  and  B  into  m  X  n  submatrices.  This 
time  we  take  m  :=  n/[lognJ.  We  assume  that  m|n.  Define  C,  =  A<B,. 

Then  C  =  Vi<lgm  Ci  and  it  takes  0{n2m)  to  compute  this.  Each  row  of  A,  has  [IognJ  elements 
and  Bt  has  that  many  rows.  Each  row  of  C,  is  a  boolean  combination  of  the  rows  of  Bit  given  by  a  row 
of  A,.  For  notational  convenience  consider  all  matrices  to  be  column  vectors  of  boolean  row  vectors.  It 
is  possible  to  compute  all  of  the  possible  boolean  combinations  of  the  rows  of  B,  in  a  reasonable  amount 
of  time.  The  following  procedure  docs  just  that: 

procedure  bcomb(intcger:  t); 
begin 

comfcjO]  :=  |0, . . . ,  0]; 

for  j  :=  1  to  2llo*nJ  -  1  do 

begin 

p  :=  [logjJ; 

comb\j]  :=  comb[j  -  2”}  V  &(.-i)[iot  nj+P+i 

end 

end; 


This  procedure  requires  0(n2)  time.  It  is  used  in  the  final  algorithm: 
algorithm  Four-Russians(array:  a,  b,  c); 

begin  co  we  assume  that  the  matrices  a,  b,  c  are  organized  as  vectors  of  rows  oc; 
const  l  =  [log  nJ ; 

var  comb:  array(0..2*  -  1]  of  boolean-vector; 
for  i 1  to  n  do  c[t)  :=  [0, . . . ,  0); 
for »  :=  1  to  n  div  l  do  co  wc  assume  that  /|n  oc 
begin 
bcomb(t); 
for  j  :=  1  to  n  do 
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begin 
nc  :=  0; 

for  k  l  downto  1  do 

if  a\j,  Jfc]  then  nc  :=  nc  +  nc  +  1  else  nc  :=  nc  +  nc; 
c[j)  :=  c[j]  V  comb(ncj 

end 

end 

end  Four-Russians. 

The  procedure  bcomb  is  called  m  times.  The  inner  loop  is  executed  nm  times.  Each  iteration 
requires  O(logn)  time  to  compute  nc  and  0(n)  to  do  the  vector  “V”.  Therefore  the  total  time  is 
0(n3/[lognJ).  If  we  count  n-bit  vector  operations  as  talcing  unit  time,  then  bcomb  requires  only  0(n) 
time.  The  vector  “V”  require  only  constant  time  each.  It  is  also  possible  to  read  the  value  of  nc  as 
consecutive  bits.  So  the  algorithm  requires  0(na/[lognJ)  vector  operations. 

7.6.  References. 

Eve  and  Kurki-Suonio  [EvK77]  have  another  algorithm  for  determining  the  transitive  closure  that 
finds  the  strongly  connected  components.  Their  algorithm  has  worst  complexity  0(|F|3),  but  is  useful 
for  sparse  graphs.  An  algorithm  with  average  time  complexity  0(\V\2  log|V|)  is  described  in  IBFM76J, 
and  one  with  linear  expected  time  in  [Sch78]. 
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Chapter  8 


Maximum  Matchings  in  Graphs 


8.1.  Fundamentals. 

In  the  following  we  assume  that  G  =  (V,E)  is  an  undirected,  connected  graph  with  no  self-loops 
and  no  parallel  edges.  This  is  no  severe  restriction  because  self-loops  and  parallel  edges  cannot  be  part 
of  matchings  as  will  become  clear  from  the  definitions  below,  and  as  the  problem  for  several  connected 
components  can  be  solved  by  treating  each  connected  component  separately. 

Definition: 

(a)  A  set  M  C  E  is  called  a  matching  if  no  two  edges  in  M  have  a  node  in  common. 

(b)  A  matching  M  is  called  perfect  if  |V|  is  even  and  \M\  =  |V|/2. 

(c)  A  matching  M  is  called  maximum  if  there  is  no  matching  M'  C  E  with  |Af'|  >  |M|. 

Remark: 

A  perfect  matching  is  also  called  a  l-faclor. 

Definition: 

A  (simple)  path 

t>l— Vj— . . .  —vik 

in  G  is  called  an  augmenting  path  with  respect  to  a  matching  M  if  v\  and  w**  are  free,  i.e.,  not  incident 
to  any  edge  in  M,  and  exactly  every  other  edge  of  the  path  is  in  M. 

There  are  some  classical  characterizations  for  whether  a  matching  is  maximum  or  whether  a  perfect 
matching  (which  is  clearly  maximum)  exists: 

Marriage  Theorem  (Frobcnius-Konig-Hall-Rado): 

Let  E  C  Vj  X  Vj  be  a  relation.  E  contains  a  matching  of  size  |F,|  (and  hence,  if  IVJ  =  |VS|,  a 
perfect  matching)  iff 

|E(K)l  >  \V\  for  every  V  C  Vt. 


Theorem  (Tuttc  1947): 

A  graph  G  =  ( V,E )  has  a  perfect  matching  iff  iKj  is  even  and  there  is  no  S  C  V  such  that  the 
number  of  odd  size  components  (i.e.,  the  number  of  connected  components  having  an  odd  number  of 
vertices)  of  the  subgraph  induced  by  V  -  S,  is  greater  than  |S|. 
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Theorem  (Berge  1957): 

A  matching  is  maximum  iff  there  is  no  augmenting  path  with  respect  to  it 

We  are  not  going  to  prove  the  first  two  theorems  but  refer  the  reader  to  [HaM8]  and  [Tut47].  A 
proof  for  Bcrge's  Theorem  will  follow  as  a  corollary  of  one  of  the  lemmas  presented  in  the  sequel. 

If  S,  T  C  E  we  denote  by  S  0  T  the  symmetric  difference 

S  ®  T  =  (S  -  T)  U  (T  -  S). 

Lemma: 

Let  M  be  a  matching,  and  let  P  be  an  augmenting  path  with  respect  to  M  (here  we  look  at  P  as 
the  set  of  its  edges).  Then  M  g)  P  is  a  matching,  and  |M  ®  P\  =  \M\  +  1. 

Proof:  The  second,  fourth,  ...  etc.  edge  of  P  is  an  element  of  M.  Hence,  as  the  first  and  last  node  of 
P  are  free,  it  is  clear  that  M  ®  P  is  again  a  matching.  And  as  P  contains  one  more  edge  not  in  M 
than  edges  in  M,  the  size  increases  by  one.  | 

Lemma: 

Let  M,  N  be  matchings,  and  let  |7Vj  >  \M\.  Then  N  ®  M  contains  at  least  |A7|  -  \M\ 
vertex-disjoint  augmenting  paths  with  respect  to  M. 

Proof:  In  the  graph  C  =  (V,  N  ®  M)  every  node  has  degree  at  most  two  because  N  and  M  are 
matchings.  Hence  every  connected  component  of  G'  is  either 

a)  an  isolated  vertex,  or 

b)  a  (simple)  cycle  with  edges  alternatingly  in  M  -  N  and  N  -  M  (note  that  such  a  cycle  must  have 
even  length),  or 

c)  a  (simple)  path  with  edges  alternatingly  in  M  -  N  and  N  -  M. 

Now  let  Ci,...,CT  be  the  connected  components  of  G'.  It  is  clear  from  the  definition  of  the 

symmetric  difference  that  N  =  M  0  Cj  <g> . . .  0  C,.  Let  A/0  be  M,  and  set,  for  t  =  1 . r, 

Mi  :=  Mi-i  &  Cj.  But  in  the  sequence  |Af0|,  |M,|, ,  \M,\  only  connected  components  in  category 
c)  and  among  them  only  those  which  start  and  end  with  an  edge  not  in  M  (and  hence  have  odd  length) 
increase  tire  number  of  matched  edges.  Now  such  paths  are  in  fact  augmenting  paths  with  respect  to  M 
because,  as  N  is  a  matching,  their  first  and  last  node  must  be  free  under  the  matching  M.  As  every 
such  path  increases  the  size  of  the  matching  by  one  according  to  the  previous  Lemma,  there  must  be  at 
least  \N\  -  |M|  such  components  and  hence  augmenting  paths.  | 

Note  here  that  Berge’s  Theorem  is  an  immediate  corollary  to  this  Lemma. 

Lemma: 

Let  M  be  a  matching  of  size  r,  and  suppose  that  the  cardinality  of  a  maximum  matching  N  is 
a  >  r.  Then  there  exists  an  augmenting  path  with  respect  to  M  of  length 


Proof:  By  the  previous  Lemma  N  ®  M  contains  s  -  r  vertex-disjoint  (and  hence  edge-disjoint)  paths. 
One  of  them  must  therefore  contain  at  most  [r/(s  —  r)J  edges  of  M,  and  hence  has  a  total  length 
bounded  by  2|r/(»  -  r)j  +  1.  | 
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Lemma: 


Let  P  be  a  shortest  augmenting  path  with  respect  to  M,  and  P'  an  augmenting  path  with  respect  to 
M  <g  P.  Then 


!P'|  >  |p|  +  |PnP'|. 


Proof:  Let  TV  :=  M  (g)  P  g)  P*.  As  therefore  \N\  =  \M\  +  2,  by  the  previous  Lemma  the  difference 
contains  at  least  two  vertex-disjoint  (and  hence  edge-disjoint)  augmenting  paths,  say  P|  and  Pt. 
Now  M  0  N  =  P  ®  P\  and  hence 

|P®  P'1  >  |P*l+|Pt|  >  2|P| 

because  P  was  chosen  as  a  shortest  augmenting  path. 

As  by  definition  |P  g)  P'|  =  |P|  +  |P'|  -  2|P  n  P'1,  we  obtain 

|P'|  >  \P\  +  2jP  n  P'|  >  |P|  +  |P  n  P'1, 

which  we  wanted  to  prove.  | 

The  combination  of  the  previous  lemmas  now  suggests  the  following  scheme  for  an  algorithm  to  find 
a  maximum  matching: 

start  with  matching  M0  =  0; 

compute  a  sequence  M0,  Po,  MltPi,...,  Mi,  Pi,...  where  P,  is  a  shortest  augmenting  path  with 
respect  to  M<,  and  Af.+i  =  M,  (g  P,. 

Of  course  we  have,  by  the  earlier  lemmas,  that  |P,|  <  IPi+i),  but  also 
Lemma: 

For  all  t  and  j  such  that  |Pj|  =  jP^,  the  paths  P,  and  P,  are  vertex-disjoint. 

Proof:  Assume  for  contradiction  that  i  and  j  are  a  closest  pair  of  indices  such  that  t  <  j,  |P;|  =  |P,|, 
and  Pi  and  P>  arc  not  vertex-disjoint  As  i  and  j  are  closest,  all  Pk  with  i  <  k  <  j  are  vertex-disjoint 
from  both  Pi  and  P,  .  Because  of  this,  P,  also  is  an  augmenting  path  with  respect  to  Mi+l  =  M,  (g)  P, 
(because  none  of  the  P*  with  *  <  k  <  j  touches  any  of  the  vertices  in  P,).  which  means  that  |P,|  > 
|P<|  +  |Pi  O  Pj|,  and  hence,  as  |P<|  =  |P,|.  that  P<  n  P,  =  ®  and  P,  and  P,  are  edge-disjoint.  But  if  P< 
and  Pj  had  a  vertex  v  in  common  they  would  also  have  in  common  the  matched  edge  of  Mi+l  incident 
on  v  (note  that  every  node  of  P,  is  incident  to  a  matched  edge  in  Mi+1!)  as  all  Pk  with  *  <  k  <  j  did 
not  affect  v.  Hence  Pi  and  P,  are  in  fact  vertex-disjoint,  contradicting  our  assumption.  | 


Theorem: 


Let  a  be  the  cardinality  of  a  maximum  matching.  Then  the  sequence 

InUPrl,... 

contains  at  most  2[a‘^J  +  1  distinct  numbers. 


Proof:  Let  r  =  [a  -  yf» J.  As  by  construction  |Mr|  =  r,  we  obtain  by  our  estimate  of  the  length  of  a 
shortest  augmenting  path  that 


\Pr\  <  2 


l"  ~  y/»J 

[  a  —  [a  —  v'aj 


+  1  <  2 

8 

J 

Lfv^lJ 

•f  1  <  +  L 
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Hcncc,  if  i  <  r,  then  |P,-|  is  one  of  the  odd  numbers  in  {1,3, 5, . ..  ,2[v/sJ  +  1}  which  has  cardinality 

v/s  +  1,  and  |P,+i| . |P»-i|  contribute  at  most  another  s  -  r  -  1  <  [y^sj  distinct  numbers,  yielding 

a  total  bounded  by  2[v/aJ  +  1.  | 

Hence  we  get  the  following 
Refined  scheme  for  a  maximum  matching  algorithm: 

M  :=  0; 

while  there  is  an  augmenting  path  with  respect  to  M  do 
begin 

l  :=  lengji  of  a  shortest  augmenting  path  with  respect  to  M\ 

find  a  maximal  (with  respect  to  set  inclusion)  set  {Qx, . . . ,  Q,}  of  augmenting  paths  with  respect  to 
M,  all  of  length  L  and  all  mutually  vertex-disjoint; 

M:=  ®  Qr 

end. 

Corollary: 

The  loop  of  the  above  algorithm  is  executed  at  most  0(|V'|I/2)  times. 

Proof:  This  is  an  immediate  consequence  of  the  fact  that  the  size  of  a  maximum  matching  is  bounded  by 

0(\V\).  I 

We  also  refer  the  reader  to  [HoK.73]. 


8.2.  Maximum  Matchings  in  Bipartite  Graphs. 

We  mw  implement  the  above  maximum  cardinality  matching  algorithm  for  bipartite  graphs  G  — 
(Vi,Vt,E)  (i.e.,  the  node  set  of  G  is  partitioned  into  two  sets  V1  and  V2,  and  all  (undirected)  edges 
arc  between  a  nodt  in  Vx  and  one  in  V2).  Mote  that  an  augmenting  graph  in  a  bipartite  graph  always 
connects  a  free  node  in  V,  to  a  free  nod?  in  V2. 

W  first  start  a  simultaneous  BFS  at  all  free  nodes  in  V,  which  at  every  odd  step  proceeds  along  all 
unmatched  edges  which  it  has  reached  at  its  foremost  level,  and  which  at  every  even  step  proceeds  from 
a  node  by  the  (unique,  if  present  at  all)  matched  edge  incident  to  that  node.  This  simultaneous  BFS 
detects  the  length  of  a  shortest  augmenting  path  when  it  reaches  a  free  node  in  V*,  and  at  the  same  time 
determines  a  subgraph  of  G  which  contains  all  shortest  augmenting  paths  (with  respect  to  the  current 
matching  M ): 

for  all  v  €  Vt  (J  V2  do  label[v]  :=  0; 
rt:=0;  l  :=  1 

co  at  stage  l  of  the  simultaneous  BFS.  the  nodes  in  R  (resp..  L  below)  arc  those  nodes  in  V2  (resp., 
V,)  visited  for  the  first  time  by  the  BFS,  and  hcncc  at  a  distance  (via  an  alternating  path)  of  l  from  a 
free  node  in  Vi  oc; 
for  all  free  nodes  v  €  Vj  do 
for  all  v — w  6  E  do 
begin 

label[w]  :=  1;  R  :=  /ZUi1®} 
end; 

while  R  7^  0  and  R  contains  no  free  node  do 
begin 
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for  all  w  £  R,  v — to  G  M  do 
if  labcl\v ]  =  0  then  begin  L  :=  //|J{v};  label[v]  :=  l  end 

co  note  chat  in  this  step  we  can  never  reach  free  nodes  in  V,  because  we  are  using  only  matched 
edges  from  V*;  hence  if  loi»e/[v]  =  0,  then  v  has  not  been  visited  by  the  search  before  oc; 

R  :=  9;  l  :=  1  +  1; 
for  all  i>  G  L,  v — w  G  E  -  M  do 
if  /abe/[w]  =  0  then  begin  R  :=  label[w\  l  end 

end; 

R  :=  the  set  of  free  nodes  in  R\ 

If  at  the  end  of  this  routine  the  set  R  is  empty  then  obviously  no  augmenting  path  exists,  and  the 
matching  M  is  maximum.  Otherwise  the  subgraph  given  by  all  edges  v—w  with  /afcef[w]  =  label[v)  + 1 
contains  all  shortest  augmenting  paths  as  follows  immediately  from  the  properties  of  BFS.  Furthermore, 
as  G  is  supposed  to  be  connected  the  above  segment  of  the  algorithm  takes  time  0{\E\). 

We  now  determine,  by  a  repeated  DFS  in  the  subgraph  determined  above,  a  maximal  set  of  vertex- 
disjoint  shortest  augmenting  paths.  Note  that  their  length  is  l.  The  DFS  uses  a  stack  with  the  obvious 
operations  on  it 


stack  :=  0; 

for  all  free  nodes  v  G  V\  do 
begin 

co  we  try  to  construct  a  shortest  augmenting  path  suiting  from  v  and  vertex-disjoint  from  all  such 

paths  constructed  before  oc 

push(v); 

while  stack  ^  0  do 
begin 

w  :=  top  (stack)', 

if  there  is  an  edge  w — w'  with  labcl\w’\  =  label\w]  +  1  then 
begin 
push(tu'); 
if  w'  G  R  then 
begin 

co  a  new  shortest  augmenting  path  has  been  found  oc 
print(8tacfc);  stack  :=  0 
end; 

label\w'\  =  0  co  this  marks  w'  as  visited  by  the  DFS  oc 

end 

else  pop 
end 
end; 

Obviously  this  algorithm  produces  a  maximal  set  of  vertex-disjoint  augmenting  paths  of  length  l,  and 
the  time  it  requires  is  again  0(|2J|).  Together  we  have  achieved  an  implementation  which  requires  for 
every  phase  described  in  the  above  scheme  time  linear  in  the  number  of  edges  of  G.  With  the  bound  on 
the  number  of  such  phases  we  hence  obtain 

Theorem; 

There  is  an  algorithm  for  the  maximum  cardinality  matching  problem  in  bipartite  graphs  which 
requires  time  bounded  by  0(|F|l/a|2J|). 
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8.3.  Maximum  Cardinality  Matching  in  General  Graphs. 

We  use  the  same  general  approach  as  pointed  out  in  section  8.1.  for  the  maximum  cardinality 
matching  problem  in  general  graphs,  i.e.,  we  try  to  design  an  algorithm  consisting  of  phases  where  in 
each  phase  a  maximal  set  of  vertex-disjoint  shortest  augmenting  paths  is  used  to  increase  the  size  of  the 
matching. 

We  are  here  only  going  to  outline  the  basic  ideas  of  such  an  implementation,  and  refer  to  [MiV80] 
for  more  details. 

The  algorithm  performs  a  simultaneous  BFS  (suiting  at  all  nodes  free  under  the  current  matching) 
which,  as  in  the  bipartite  case,  alternates  between  unmatched  and  matched  edges.  Each  time  before  a 
new  level  is  added  to  the  BFS,  the  algorithm  checks  for  bridges.  A  bridge  is  an  edge  such  that  both 
its  endpoints  have  been  reached  by  the  BFS,  and  both  in  steps  involving  the  same  kind  (matched  or 
unmatched)  of  edges.  Also,  if  they  were  reached  by  matched  edges,  the  bridge  must  be  an  unmatched 
edge,  and  vice  versa. 

For  every  bridge,  the  algorithm  then  checks  by  backtracing  whether  there  are,  from  the  two  endpoints 
of  the  bridge,  two  vertex-disjoint  alternating  paths  to  free  nodes.  If  such  paths  arc  found  then  in  fact  a 
shortest  augmenting  path  has  been  found,  and  it  is  erased  from  the  graph,  together  with  all  edges  incident 
to  nodes  on  the  path  in  order  to  avoid  that  any  of  its  parts  is  visited  by  the  BFS  later  on.  The  backtracing 
is  done  by  running  two  DFS’s  (one  from  every  endnodc  of  the  bridge)  in  parallel  and  in  a  way  such  that 
the  nodes  furthest  down  in  the  DFS  trees  are  at  levels  in  the  BFS  trees  differing  by  at  most  one. 

If  the  two  DFS's  fail  to  discover  an  augmenting  path  they  actually  discover  a  blossom ,  that  is  an  odd 
length  cycle  with  a  maximal  number  of  matched  edges  in  it  which  became  closed  by  the  bridge.  Such 
a  blossom  is  shrunk  to  a  pseudo-node  by  introducing  path  compression  pointers  from  all  its  nodes  to  its 
root  which  is  the  unique  node  in  the  cycle  not  incident  to  a  matched  edge  also  in  the  cycle.  In  this  way, 
edges  in  the  blossom  need  not  be  traversed  any  more  in  later  searches  for  augmenting  paihs.  Blossoms 
can  occur  one  within  another.  A  blossom  is  reexpanded  if  an  augmenting  path  is  detected  which  conuins 
the  pseudo-node  belonging  to  it. 

A  careful  implementation  of  these  concepts  guarantees  that  in  every  phase  each  edge  of  G  has  to 
be  visited  only  a  constant  number  of  times,  and  also  that  the  overhead  for  all  operations  is  bounded  by 
0(| IS\).  For  details  sec  [MiV80].  We  obtain  the  following  result: 

Theorem: 

There  is  an  0(|V|,',|£|)  algorithm  for  the  maximum  cardinality  matching  problem  in  general  graphs. 


8.4.  Maximum  Weight  Matching  Problems. 

Another  generalization  of  matching  problems  considers  (undirected  and  connected)  graphs  G  = 
(V,E)  where  every  edge  is  assigned  a  weight  c(e)  >  0.  The  weight  c[M)  of  a  matching  M  C  E  is 
then  defined  to  be 

c{M)  :=  ^2  c(e). 

tgM 

A  matching  M  is  called  a  maximum  weight  matching  if  there  is  no  matching  M'  of  (G,  c)  which  has 
weight  c(M')  >  c (M). 

For  the  maximum  weight  matching  problem  there  arc  no  results  known  equivalent  to  those  presented 
in  section  8.1.  for  the  maximum  cardinality  problem.  Instead,  the  most  efficient  algorithms  known  for  this 
problem  use  the  dual  problem  that  arises  when  one  formulates  the  maximum  weight  matching  problem 
as  a  linear  optimization  problem,  lhis  dual  problem  serves  to  select  a  sequence  of  augmenting  paths  in 
order  to  iteratively  increment  the  weight  of  the  matching.  For  more  details  sec  [Law76].  We  summarize: 
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Theorem: 

(a)  There  is  an  algorithm  for  the  maximum  weight  matching  problem  of  time  complexity  0(|V j*). 

(b)  There  is  an  algorithm  for  the  maximum  weight  matching  problem  of  complexity  0(|l'||£|  log|V'|). 

For  the  first  of  these  algorithms,  we  refer  to  [Law76],  for  the  second  to  [GaM81]. 


Chapter  9 


Maximum  Flow  in  Networks 


9.1.  Flows  and  Cuts. 

A  network  is  a  digraph  G  =  (V ,E)  with  no  self-loops  and  parallel  edges  (anti-parallel  edges  are 
permitted),  and  with  two  distinguished  nodes,  the  source  s  and  the  sink  tj^s.  In  addilition,  every  edge 
e  £  E  is  assigned  a  nonnegative  capacity  c(e ).  A  flow  in  G  is  a  real  function  /  from  E  such  that 

(i)  for  every  edge  e  £  E  its  value  is  bounded  by  the  capacity  of  the  edge:  0  <  /(e)  <  c(e);  and 

(ii)  for  every  node  v  different  from  s  and  t,  Kirchhoff's  law  is  satisfied: 

£  I(e)~  £  f(e')  =  0. 

By  the  total  flow  F  =  F{f)  we  mean  the  value  of  the  above  sum  at  the  source  s,  i.e. 

£  /(e)-  £  f(e'). 

e—($,w)£E  e#=(iu,«)6£ 

It  should  be  clear  that  this  sum  is  the  negative  of  the  corresponding  sum  at  the  sink  t. 

A  flow  /  in  G  is  called  maximum  if  F(f)  is  maximum  for  all  flows  in  G  (with  capacity  c). 

Definition: 

A  cut  of  the  network  G  is  a  subset  S  of  V  such  that  s  6  S  and  t  S. 

For  a  cut  S,  let  J  denote  the  complement  V  —  S  of  S,  and  let  Es  ■§  be  the  subset  of  edges  in  E 
from  nodes  in  S  to  nodes  in  5  (E-g  s  is  defined  analogously). 

Lemma: 

For  every  cut  S  and  every  flow  /  we  have 

F(f)=  £  /(c)-  £  /(c). 

Proof:  Note  that  if  wc  sum 

F.:=  £  /(c)-  £  f(e') 

e3s(w,u>)€£  e'  =  (u>,v)6£ 
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over  all  v  €  S  we  obtain  F,  which  equals  F[f)  as  all  the  other  Fv  are  zero  by  condition  (ii)  for  Rows. 
However,  if  we  rearrange  the  terms  in  the  sum  all  those  /(e)  where  both  endpoints  of  e  are  in  S  cancel 
because  they  appear  exactly  twice  and  with  opposite  sign,  and  we  are  left  with  the  right  hand  side  of  the 
equation  claimed  in  the  Lemma.  |  _ 

If  we  set  the  capacity  of  a  cut  5  to  be 

C(S):=  £  c(e), 

t£Bs,  T 

we  immediately  obtain  that 

F  <  C(S)  for  every  cut  S. 

This  upper  bound  can  in  fact  be  achieved  as  stated  in  the  following 
Theorem  (Ford,  Fulker  i  1956): 

max{F(/);  /  flow  in  G }  =  min{C(S);  S  cut  in  G) 

The  proof  of  this  Theorem  will  follow  from  the  next  lemma  (also  see  |FoF56]). 

Definition: 

A  sequence 

a  =  v0 — l>i — vt — . . .  — vr_i — tv 

of  edges  — v,  without  node  repetition  is  called  an  augmenting  path  to  vT  with  respect  to  some  flow 
/  if  for  all  t,  t  =  1, . . . ,  r,  either 

a)  e  =  (tv-i,  Vi)  €  E  and  c'(e)  :=  c(e)  -  /(e)  >  0,  or 

b)  e  =  (tv,  v,_i)  €  E  and  c'(e)  :=  /(e)  >  0. 

An  augmenting  path  (with  respect  to  /)  is  an  augmenting  path  to  the  sink  t. 

It  is  clear  that  if  there  is  an  augmenting  path  P  with  respect  to  a  flow  /  then  /  can  be  increased 
along  this  path  by  S  =  min{c'(c);  e  edge  in  P}  in  the  following  way:  If  an  edge  e  in  P  is  directed  from 

8  to  t  its  flow  value  is  increased  by  6,  and  if  e  is  directed  the  other  way,  it  is  decreased  by  the  same 

amount.  This  change  clearly  maintains  KirchhofF’s  law  for  all  nodes  different  from  the  source  and  sink, 
and  it  increases  the  total  flow  by  6. 

Lemma: 

A  flow  is  maximum  iff  it  has  no  augmenting  path. 

Proof:  It  only  remains  to  show  the  sufficiency  of  the  condition  in  the  Lemma.  Hence  suppose  that  a 
flow  /  has  no  augmenting  path,  and  let  S  be  the  set  of  nodes  v  reachable  from  the  source  s  by  an 
augmenting  path  to  v.  Then  s  6  S  as  the  empty  path  is  an  augmenting  path  (to  s),  and  t#S  as  there 
is  no  augmenting  path  in  G  with  respect  to  /.  The  definition  of  S  also  implies  that  for  every  edge 
eg  £sj,  the  flow  /(c)  equals  the  capacity  c(e)  (because  otherwise  the  endpoint  of  e  would  also  be  a 
member  of  S’),  and,  for  the  same  reason,  that  for  every  edge  e  £  E-§  s,  the  flow  /(e)  is  zero.  Therefore 
we  obtain,  by  the  previous  Lemma,  that  in  fact  F(f)  =  C(S)  from  which  we  can  conclude  that  /  is 
maximum  as,  by  the  above  Corollary,  F{f)  <  C(S)  for  every  flow  /'.  | 

Note  that  as  a  corollary  we  have  also  obtained  a  proof  of  the  Min-Cut-Max-Flow  Theorem  stated 
earlier. 

The  results  presented  so  far  suggest  to  constaict  a  maximum  flow  by  starting  with  some  flow  satisfying 
conditions  (i)  and  (ii),  and  then  trying  to  successively  increase  it  along  augmenting  paths  until  such  a  path 
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nc  longer  exists  in  which  ease  the  resulting  flow  is  guaranteed  to  be  maximum.  This  idea  also  underlies 
the  initial  method  by  Ford  and  Fulkerson  to  construct  a  maximum  flow.  But  care  has  to  taken  in  selecting 
the  augmenting  paths  (Zah73).  As  a  matter  of  fact,  Ford  and  Fulkerson  pointed  out  an  example  (with 
irrational  edge  capacities  c(e))  for  which  the  simple  method  (which  consistently  makes  “bad”  choices)  fails 
to  terminate  and  even  to  converge  towards  the  actual  maximum  flow.  But  also  if  the  edge  capacities  are 
all  positive  integers  the  edge  selection  is  critical  for  the  efficiency  of  the  algorithm  as  demonstrated  by  the 
following  example.  Let  M  be  a  (very)  big  positive  integer  such  that  the  size  of  its  binary  representation 
dominates  the  size  of  the  whole  network. 


a 


If  we  start  in  this  example  with  a  zero  flow  and  select  as  augmenting  paths  altematingly  s — a — b — t 
and  s — b — a — t  it  takes  the  algorithm  0(M)  augmentations  to  reach  the  maximum  flow,  and  hencu  a 
running  time  exponential  in  the  size  of  the  input. 


9.2.  The  Dinits  Algorithm. 

We  now  show  how  to  overcome  the  difficulty  indicated  above  by  a  method  using  so-called  pre-flows. 
Call  an  edge  v — w  in  the  network  (C,  c)  augmentable  from  v  to  w  if 

(a)  e  =  (v,  w )  €  B  and  /(e)  <  c(e),  or  if 

(b)  e  =  {w,  v)  6  E  and  /(e)  >  0. 

The  following  algorithm  divides  the  network  (C,  c)  which  has  some  flow  /  defined  on  it,  into  disjoint 
layers  V0, . . . ,  V,  C  V  using  a  BFS: 

Vo  :=  {a};  i  :=  0; 

repeat 

T  :=  {it,  there  is  an  edge  w — v  augmentable  from  w  to  v  with  w  6  V,  and  v  V)}; 

i  :=  t  +  1;  V,  :=  T 
until  T  =  0  or  t  G  T; 

if  T  —  0  then  stop  “flow  maximum”  else  V,  :=  {t}; 

The  layered  network  G'(C,c,  /)  now  consists  of  all  the  nodes  in  the  V}  (the  final  value  of t  is  called 
the  length  of  the  network  and  denoted  by  l)  and  the  edges  in  any  of  the  fij,  j  =  l,...,  I  with 

Ej  :=  {u  -+  w\  v  €  Vj-i,  vt  €  Vj,  and  v — w  €  E  augmentable  from  v  to  w}. 

Note  that  the  direction  of  the  edges  in  the  ZJ,  may  be  different  from  the  original  direction  in  E.  In 
addition  every  edge  e  =  (v,  w)  in  the  Ej  is  assigned  the  capacity  c'(e)  which  is  the  sum  of  the  values 
c(e')  —  /(c')  respectively  f(e')  over  all  edges  e'  between  v  and  w  which  arc  augmentable  from  v  to  w. 

Once  we  have  a  nonzero  flow  /'  in  the  layered  network  G'{G,c,f)  it  is  clear  how  to  augment  the 
flow  /  in  (C,c):  If  the  flow  along  an  edge  in  the  layered  network  is  in  the  same  direction  as  in  G  it  is 
added  to  /(e),  and  otherwise  subtracted  (a  little  bit  of  care  has  to  be  exercised  if  there  are  two  antiparallcl 
edges  between  some  nodes  v  and  w  in  (C,  c,  /)  which  arc  both  augmentable  from  v  to  to  and  have  been 
replaced  by  one  edge  in  the  layered  network). 


8.2.  Thf.  Dinits  Algorithm. 
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Definition: 

A  flow  f  in  a  layered  network  is  maximal  if  every  path  from  s  to  t  in  the  layered  network  contains 
an  edge  e  with  f'(e)  =  c'(e). 

We  now  propose  to  construct  a  maximum  flow  in  a  network  (G,c)  in  phases  which  consist  each  of 
the  following  steps: 

start  with  flow  /  (initially  the  zero  flow); 

construct  the  layered  network  C'(G,  c,  f)  together  with  the  capacity  c'\ 
find  a  maximal  flow  /'  in  the  layered  network; 
augment  /  by 

As  the  layered  network  G'(G,  c,  f)  contains  all  shortest  augmenting  paths  with  respect  to  /  (some  edges 
may  be  reversed  with  respect  to  G)  we  obtain  a  similar  situation  as  for  the  maximum  capacity  matching 
problem. 

Lemma: 

The  layered  network  of  the  fc-th  phase  has  length  at  least  fc. 

Proof:  The  proof  is  by  induction  on  k.  For  k  —  1  it  follows  from  the  condition  that  the  sink  t  is  different 
from  the  source  s.  For  the  inductive  step  we  note  that,  in  the  fc-th  phase,  there  can  be  no  augmenting 
paths  of  length  <  k  —  1 ,  and  also  that  the  maximal  flow  constructed  in  the  k  —  1  -st  phase  saturates  at 
least  one  edge  on  each  augmenting  path  of  length  k  —  1.  As  all  these  paths  appeared  in  the  k  —  1-st 
phase,  no  augmenting  path  of  length  <  k  is  left  at  the  beginning  of  the  fc-th  phase.  But  this  implies  that 
the  fc-th  layered  network  has  length  at  least  fc,  and  completes  the  induction.  | 

Corollary:  The  number  of  phases  (as  given  in  the  above  schematic  algorithm)  is  bounded  by  |K|. 

In  the  following,  we  shall  be  concerned  with  exhibiting  various  efficient  algorithms  to  construct 
maximal  flows  in  layered  networks.  Such  flows  are  also  called  preflows. 

Dinils'  Method: 

Die  first  solution  is  due  to  Dinits  [Din70],  It  constructs  a  maximal  flow  by  an  iterated  DFS  from  the 
source  to  the  sink.  Each  successful  search  determines  a  new  augmenting  path,  the  value  of  the  additional 
flow  is  given  by  the  minimal  capacity  of  an  edge  on  the  path. 

construct  G'{G,c,f)  and  c'; 

for  all  edges  e  in  G'  do  f'\e)  :=  0; 

while  there  remains  an  edge  from  the  source  s  do 

begin 

construct  by  a  DFS  a  path  P  from  s  to  t  and  delete  during  this  DFS  all  dead-end  edges 

co  an  edge  is  a  dead-end  edge  if  the  DFS  cannot  proceed  from  its  endpoint  (y^  t)  because  it  has  no 

successors  oc; 

A  :=  min{c'je];  e  edge  in  P}\ 

for  all  c  £  P  do 

begin 

•  f'\e]  ■■=  /'M  +  a; 

if  c'[c|  =  A  then  delete  e 

end 

end. 
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Clearly  the  algorithm  produces  a  maximal  flow  because  it  saturates  at  least  one  edge  on  every  path 
from  the  source  to  the  sink.  The  time  requirement  per  phase  is  Ofll7 1|£|)  because  between  any  two  edge 
deletions  there  are  at  most  l  steps  where  /  is  the  length  of  the  layered  network,  and  of  course  l  <  |Vj. 
Hence  we  conclude 

Theorem: 

Dinks’  Algorithm  constructs  a  maximum  flow  in  an  arbitrary  network  in  time  0(|V|2|E|). 


9.3.  The  Malhotra-Pramodh  Kumar-Maheshwari  Algorithm. 

More  efficient  algorithms  for  the  maximum  flow  problem  are  also  based  on  the  above  subdivision 
into  phases  and  the  construction  of  maximal  preflows.  But  they  don't  use  augmenting  paths.  The  first 
such  algorithm  is  due  to  Karzanov  [Kar74].  We  shall  here,  however,  present  another  algorithm  of  the 
same  asymptotic  complexity  which  is  easier  to  describe  (MPM78). 

Given  some  preflow  f  in  the  layered  network  G',  let  us  define  the  potential  P[v)  of  some  node  v  to 
be  the  minimum  of  the  sum  of  all  remaining  capacities  c'(e)  -  /'(e)  taken  over  all  edges  e  leaving  v,  and 
the  same  some  over  all  edges  entering  into  v.  Clearly,  P(v)  is  tire  maximum  amount  by  which  the  flow 
through  v  can  be  increased,  if  we  look  just  at  v.  MPM  algorithm  can  now  be  described  as  follows: 

while  P(s)  and  P(f)  arc  5^  0  do 
begin 

let  v  be  some  node  with  minimal  P(v); 

starting  from  v ,  push  additional  flow  of  P(v)  units  towards  t.  update  the  flow  function  accordingly, 
and  delete  edges  which  become  saturated; 

in  the  same  way,  starting  from  t>,  pull  additional  flow  of  P(v]  units  from  «; 
update  P; 

while  there  is  v  5^  s,  t  with  P(v)  =  0  do 
begin 

remove  v  and  its  incident  edges;  . 
update  P 
end 
end. 

The  pushing  and  pulling  of  additional  flow  is  done  in  a  way  such  that  at  every  node  that  gets  touched 
the  edges  arc  scanned  in  the  order  in  which  they  appear  on  the  adjacency  list,  and  as  much  additional 
flow  is  added  as  possible,  as  long  as  the  supply  lasts.  In  this  way,  at  most  one  outgoing  (resp.,  incoming) 
edge  receives  nonzero  additional  flow  without  becoming  saturated  at  the  same  time.  Hence  the  time 
bound  for  one  pass  of  the  outer  loop  is 

0{\V\  +  the  number  of  edges  getting  saturated  (and  hence  deleted)). 

As  in  every  such  pass  a  node  gets  removed,  we  obtain  a  time  bound  of  0(\E\  +  |V|2)  per  phase, 
and  hence  the 

Theorem; 

The  MPM  maximum  flow  algorithm  has  time  complexity  0(|K|3). 

As  we  remarked  already  earlier,  the  Karzanov  algorithm  [Kar74]  has  the  same  asymptotic  complexity. 
A  somewhat  more  efficient  algorithm  for  the  maximum  flow  problem  which  has  time  complexity 
0(|  V |*j/?|l/*)  is  described  in  (Chc77).  Other  efficient  solutions  can  be  found  in  (GaN79],  |Shi78J  (both 
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of  complexity  0(|V||i?|logs|V,|)),  and  in  |Slc80]  (complexity  0(|V||i?|  log| V |)).  These  latter  solutions 
make  use  of  more  elaborate  data  structures  to  find  maximal  preflows. 


9.4.  Extensions  and  Restrictions. 

We’d  like  to  shortly  mention  some  variations  of  the  basic  maximum  flow  problem.  For  a  more 
detailed  discussion  we  refer  to  [Eve79]  and  [Law76]. 

(a)  In  addition  to  the  capacity  (unction  c  we  may  have  another  function  6  on  the  edges  representing  a 
lower  bound  for  the  flow  /  through  each  edge.  In  this  case  we  would  demand  6(e)  <  /(e)  <  c(e) 
for  every  edge  e,  and  of  course  Kirchhoff’s  law  as  before  for  every  node  other  than  the  source  and 
sink.  Contrary  to  the  situation  of  the  basic  maximum  flow  problem  where  the  zero  flow  was  a  legal 
flow,  a  legal  flow  might  not  exist  at  all  in  a  network  with  arbitrary  lower  bounds.  However,  we  can 
reduce  the  test  for  the  existence  of  (and  the  construction  of)  a  legal  flow  to  another  maximum  flow 
problem  with  zero  lower  bounds  and  size  proportional  to  the  size  of  the  original  problem.  Once  we 
have  established  a  legal  flow  wc  may  use  any  of  the  method1',  discussed  above  (with  the  appropriate 
modifications  in  the  definition  of  the  capacity  c'  of  the  edges  in  the  layered  network)  to  construct  a 
maximum  flow. 

(b)  Instead  of  looking  for  a  maximum  flow,  wc  might  be  interested  in  a  minimum  flow.  But  clearly,  as 
F,  =  —Ft,  a  minimum  flow  from  s  to  f  is  a  maximum  flow  from  t  to  8. 

(c)  Often  the  capacities  of  the  edges  in  a  network  are  either  zero  or  one.  Such  a  network  (and  we  even 
allow  here  parallel  edges)  is  called  a  0-1  -network.  Note  that  in  a  layered  network  coming  from  a 
0-1-network,  every  edge  on  an  augmenting  path  becomes  saturated.  Also  note  that  a  layered  network 
which  allows  a  big  preflow,  intuitively  has  to  be  wide  because  every  edge  accomodates  at  most  one 
unit  of  flow.  Using  these  properties,  one  can  derive  the  following 

Theorem: 

For  0-1-networks,  the  Dinits  algorithm  has  time  complexity  0(|£7|s/*). 

(d)  A  special  case  of  0-1-networks  are  those  with  no  parallel  edges  (also  called  0-1-networks  of  type  1). 
For  them,  we  obtain 

Theorem: 

For  0-1-networks  of  type  1,  the  Dinits  algorithm  has  time  complexity  0( |V|i/*). 

(e)  An  even  more  restricted  class  of  0-1-networks  are  those  where  every  node  has  either  at  most  one 
outgoing  or  at  most  one  entering  edge.  These  networks  are  called  0-1-networks  of  type  2.  It  is 
possible  for  them  to  derive  an  even  better  upper  bound: 

Theorem: 

For  0-1-nctworks  of  type  2,  the  Dinits  algorithm  has  time  complexity  0(jF|l/1|£|). 

The  basic  idea  for  the  proof  is  again  a  bound  on  the  length  of  the  corresponding  layered  networks. 


95.  Applications. 

Wc  finally  state  some  applications  of  the  algorithms  for  the  maximum  flow  problem  to  other  problems. 
Again  we  only  list  the  results  and  refer  for  the  details  to  [Evc79]  and  [Law76). 
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(a)  The  vertex-connectivity  of  a  directed  graph  is  the  minimal  number  of  nodes  that  has  to  be  removed 
from  it  such  that  there  remain  two  vertices  a  and  b  which  are  no  longer  connected  by  a  path  from  a 
to  6.  If  there  is  an  edge  from  every  vertex  to  every  other  vertex,  we  set  the  edge-connectivity  to  be 
|V|  -  1  by  default.  An  analogous  definition  is  made  for  undirected  graphs.  The  determination  of  the 
vertex-connectivity  of  (directed  or  undirected)  graphs  can  be  reduced  to  a  maximum  flow  problem, 
in  which  basically  every  node  of  the  graph  is  split  into  two  nodes  for  the  network  connected  by  an 
edge  with  unit  capacity.  The  algorithm  then  checks,  for  every  pair  (a,  6)  of  nodes  in  the  graph,  the 
flow  from  the  node  corresponding  to  a  to  that  corresponding  to  b  in  the  network.  As  the  network 
is  of  type  2,  this  can  be  done  in  time  0(|V|1/2|Z?j)  for  every  pair,  and  hence  in  a  total  time  of 
0{\V\bli\E\).  However,  a  slightly  better  bound  can  be  obtained  if  we  observe  that  not  all  pairs  of 
nodes  have  to  be  checked.  As  a  matter  of  fact  it  suffices  to  check  k  nodes  against  all  others  if  by 
then  we  have  detected  two  nodes  for  which  the  removal  of  <  k  nodes  is  enough  to  separate  them. 
We  then  get  the 

Theorem:  The  vertex-connectivity  of  a  (directed  or  undirected)  graph  can  be  determined  in  time 
0(|K|1/2|f?|2). 

(b)  If  we  are  only  interested  if  the  vertex-connectivity  is  at  least  k  for  some  given  k,  we  can  stop  each 
flow  problem  once  the  flow  reaches  k  units. 

Theorem:  It  can  be  determined  in  time  0(k3 \E\  +  fc|V'||£'|)  (that  is  in  time  0(A:|F||£,|)  if  k  = 
0{\V\'/2))  whether  a  graph  is  fc- vertex -connected. 

(c)  The  edge-connectivity  of  (directed  or  undirected)  graphs  is  defined  analogously  to  the  vertex- 
connectivity.  However,  the  reduction  to  a  maximum  flow  problem  is  here  completely  trivial.  Also, 
wc  have  to  solve  only  |V|  —  1  network  flow  problems.  A  further  possibility  is  to  run  these  network 
flow  problems  in  parallel,  one  augmenting  path  at  a  time,  until  one  of  the  flows  cannot  be  increased 
any  more.  Thus  we  obtain 

Theorem:  The  edge-connectivity  c  of  a  (directed  or  undirected)  graph  can  be  determined  in  time 

(i)  0(|V||25| minfliSl1^2, |V |,/3 }),  or 

(ii)  0(c|V  111*1). 

(d)  As  a  final  application,  we  mention  the  maximum  cardinality  matching  problem  in  bipartite  graphs. 
The  reduction  to  a  maximum  flow  problem  is  straightforward,  and  wc  obtain,  using  Dinits’  algorithm, 
the  same  bound  as  derived  in  section  8.2.,  namely  0(\V\x^\E\). 


Chapter  10 


Problems 


Problem  t: 

Give  some  examples  of  algorithms  which  you  would  not  consider  “combinatorial". 

Problem  2: 

Prove  the  following  formulae  for  the  function  T  defined  by 

T(n)  =a7’(-”  )  +  an',  if  n  =  m*  >  l, 
in 

T(l)=d 

(where  a  >  0,  e  >  0,  s  >  0,  and  m  an  integer  >  2): 

!0(n'),  if  a  <  m* 

0(n'  logm  n),  if  s  =  m* 

©(nioe„  *)(  if  s  >  m' 

(assume  that  7’(n)  =  0  for  n  not  a  power  of  m). 

Give  an  explicit  solution  to  the  above  recurrence  (still  assuming  n  =  mk). 

Problem  3: 

In  a  situation  analogous  to  the  one  underlying  the  previous  problem  but  where  n  is  not  necessarily 
a  power  of  m,  one  might  get  a  recurrence  like 

2’(n)  =s7'([  — ])  +  an',  if  n  >  1 
m 

T(\)~d. 

Usually  there  arc  two  approaches  to  this  problem: 

(1)  embed  a  problem  of  size  »  into  one  of  size  n'  where  n'  =  rnlo*"-n  is  the  next  larger  power  of  m; 
(2)  embed  a  problem  of  size  n  into  one  of  size  n'  where  n'  =  f^]m  is  the  next  larger  multiple  of  m. 

Compare  these  approaches  and  the  bounds  on  the  resulting  complexities! 

(Hint:  use  the  facts  that 
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PROBLEM  i: 

When  analyzing  the  Schonhagc-Strasscn  algoritlim  for  fast  integer  multiplication  we  arrive  at  a 
recurrence  relation  for  its  time  complexity  T  of  the  form 

T(n)  =27’(fn1/2])  +  clogn,  ifn>3 
T(  2)  =T(1)  =  d. 

Solve  this  recurrence  using  the  techniques  presented  in  class  (assume  in  light  of  Problem  3  that  n  is  of 
an  appropriate  form). 

PllOBLKM  5: 

In  this  problem  you  are  asked  to  consider  the  implementation  details  of  binomial  trees  and  queues. 

(a)  Describe  an  implementation  and  show  how  the  operation  of  merging  two  binomial  trees  would  be 
done. 

(b)  Show  how  to  delete  the  smallest  element  in  a  set  represented  by  a  binomial  queue  using  your 
implementation  in  (a).  Use  the  algorithm  of  merging  binomial  queues  mentioned  in  class. 

(Hint:  use  an  eldest  son,  next  brother  representation) 

PROBLEM  6: 

Now  we  are  interested  in  the  operation  of  deleting  some  arbitrary  element  that  we  have  a  pointer 
to.  Modify  your  implementation  of  binomial  queues  in  Problem  5  so  that  it  allows  this  and  the  other 
operations  to  be  done  efficiently.  Your  implementation  should  use  only  two  pointers  per  node.  Describe 
how  the  delete  arbitrary  operation  is  done. 

PROBLEM  7: 

Prove  the  Decomposition  Lemma  stated  in  class. 

PROBLEM  8: 

Analyse  both  time  and  space  complexity  of  an  algorithm  for  computing  22"  on  a  RAM.  Use  both 
the  unit  and  logarithmic  cost  criterion. 

PROBLEM  9: 

Describe  how  to  modify  the  Patcrson-Pippcngcr-Schonhagc  median  algorithm  presented  in  class  so 
that  you  obtain  an  algorithm  for  arbitrary  selection  (i.c.,  an  algorithm  for  finding  tire  »'-th  largest  in  an 
n-clcmcnt  set  for  any  i  with  1  <  1  <  n),  and  determine  its  complexity. 


Problem  10: 

Let  ((/,-<)  be  a  totally  ordered  universe,  and  w(a)  >  0  a  weight  for  every  »  G  (/.  Describe  an 
efficient  algorithm  based  on  the  Blum-Ployd-Pratt-Rivcst-Tarjan  selection  algorithm  presented  in  class, 
which,  for  any  given  finite  5  C  U  and  w  >  0,  determines  the  maximal  (with  respect  to  C)  subset  M 
of  smallest  (with  respect  to  -< )  elements  of  S  such  that 

Y,  w(»)  <  w, 

•€Af 

and  analyze  Its  complexity  (in  terms  of  number  of  comparisons). 
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Problem  11: 

Suppose  you  arc  given  k  numbers  t0  =  0  <  *(  <  . . .  <  **  <  t*+l  =  n  +  1  and  you  want  to 
select  simultaneously  the  it  smallest,  the  *2,  ...,»*  smallest  clement  of  the  n-clcmcnt  set  S.  Develop  a 
multiple  selection  algorithm  which  solves  this  problem  using  at  most 

k 

0(n  log  n  -  5Z(tj  +  i  -  tj)  log(tJ+1  -  »>)) 

7—0 


comparisons. 

Problem  12: 

Consider  the  problem  of  finding  the  median  of  n  elements  on  a  computer  with  limited  space. 
Specifically,  suppose  that  data  elements  are  much  bigger  than  other  things,  so  that  there  is  room  to  store 
[n/2]  +  l  elements  and  a  linear  amount  of  other  stuff  that  can  not  encode  data  elements.  Find  an 
algorithm  subject  to  these  constraints  that  finds  the  median  while  doing  only  linearly  many  comparisons, 
for  the  case  where  «  =  2*  —  1. 

(Hint:  Use  the  solution  to  the  previous  problem  with  i3  =  V  —  1  for  j  —  l,...,fc  —  2,  and 
ij+k-2  —  n  —  2J  +  1,  also  for  j  =  l,...,fc  —  2,  and  apply  it  to  the  first  fn/2"l  elements  read  into 
memory.  Then  iteratively  input  as  many  elements  as  there  is  space  left,  and  discard  those  which  cannot 
possibly  be  the  median.) 

Problem  13: 

Give  in  PASCAL-likc  notation  an  0(e  log  log  n)  time  implementation  of  the  minimum  spanning  tree 
algorithm  MST2  discussed  in  class.  You  may  assume  that  the  vertices  of  the  graph  arc  represented  by  the 
integers  The  input  data  consist  of  an  (unordcrcd)  list  of  triples  (v,  w,c(i>,te))  which  contains 

every  edge  exactly  once  ((v,w)  =  (w,v)). 

(Warning:  You  may  not  assume  that  arrays  are  initialized  in  a  special  way!) 

Problem  14: 

Develop  a  linear  time  minimum  spanning  tree  algorithm  for  planar  graphs!  (Planar  graphs  arc  those 
which  can  be  drawn  in  the  plane  without  edges  crossing  one  another.)  F.mploy  a  similar  setting  as  in  the 
first  phase  of  algorithm  MS  12  and  make  additional  use  of  the  following  facts: 

a)  A  planar  graph  with  n  nodes  has  at  most  3n  -  3  edges  (no  parallel  edges  are  allowed;  if  n  >  3  the 
number  of  edges  is  even  bounded  by  3n  —  6). 

b)  If  an  edge  in  a  planar  graph  is  shrunk  to  a  node  the  graph  remains  planar. 


Problem  15: 

Constnict  an  example  which  shows  that  your  approach  in  the  previous  problem  docs  not  work  for 
arbitrary  graphs  with  e  —  0(n)  edges. 


Problem  16: 

Prove  an  n(mlogn)  lower  bound  for  the  unweighted  UNION-FIND  algorithm  which  uses  path 
compression,  for  a  sequence  of  n  UNION’S  and  m  >  n  (intermixed)  KIND’S. 

(Hint:  Remember  the  two  different  ways  to  look  at  binomial  trees!) 
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PltOHl.KM  17: 

Give,  in  a  PASCAL-like  language,  an  implementation  of  the  UNION-FIND  operations  with  weighting 
heuristic  and  path  compression  which  uses  only  0(1)  auxiliary  space  (this  means  that  you  cannot  use  a 
stack  for  intermediate  storage  of  the  FIND  path;  try  instead  to  store  diis  FIND  path  in  situ).  Assume 
that  each  element  is  represented  as  an  element  of 
type  pair=  record  elr.  integer;  next:  tpair  end; 

and  that  die  pointer  of  the  pair  for  the  root  of  a  tree  points  to  a  pair  containing  the  size  of  the  tree  and 
a  nil-pointer. 

PltOHLKM  18: 

Let  (G,  c)  be  a  connected,  undirected  graph  with  nonnegative  edge  weight  c.  Let  C  be  a  cycle  in  G, 
and  e  an  edge  in  C  such  that  c(e)  is  maximal  among  all  edges  in  C. 

a)  Prove  that  there  is  a  minimum  spanning  tree  for  ( G ,  c )  which  does  not  contain  e. 

b)  Prove  that  if  c(e)  is  strictly  larger  dian  c(e')  for  all  other  edges  e'  in  C,  no  minimum  spanning  tree 
for  (G,  c)  contains  e. 

PltOHl.KM  19: 

Let  G  be  a  connected,  undirected  graph  with  n  nodes  and  e  edges,  and  let  T  be  a  (rooted)  spanning 
tree  for  G.  Design  an  algorithm  of  time  complexity  0(e(n))  which,  for  every  edge  (v,  w}  in  G  and  not 
in  7’,  determines  the  lowest  common  ancestor  of  v  and  w  in  T  (the  lowest  common  ancestor  of  v  and  w 
is  die  node  of  T  where  the  unique  paths  from  the  root  of  T  to  v  respectively  w  part). 

(Hint:  In  a  post-order  search  of  T  you  hit  exactly  twice  a  node  incident  to  a  non-tree  edge  of  G. 
The  second  time,  the  lowest  common  ancestor  is  on  top  of  the  nodes  you  have  visited  so  far.) 

PltOHl.KM  20: 

Let  (G,  c)  be  a  connected,  undirected  graph  with  nonnegativc  edge  weight  c,  and  let  T  be  a  spanning 
tree  for  G.  Using  the  results  of  the  two  previous  problems,  describe  an  0(c(n))  algorithm  which  checks 
whcdicr  T  is  a  minimum  spanning  tree  for  (C,  c). 

PltOHl.KM  21: 

Describe  how  to  modify  die  algorithm  reg-pat  presented  in  class  such  as  to  find  for  every  position  j 
in  text 

a)  the  minimal  i  such  that  tezt< j  €  La; 

b)  the  maximal  i  such  that  textiti  €  Ln. 

(Hint;  Associate  in  the  simulation  of  the  pattern  automaton  Ma  an  appropriate  count  with  every 
state.) 

PltOHl.KM  22: 

Suppose  a  =  aio2...ar  is  a  string.  A  substring  of  a  is  a  string  c  of  the  form  c  =  a.-.Oj,.  ..a,„, 
where  0  <  it  <  is  <  ...  <  tn  <  r  + I.  That  is,  the  letters  of c  occur  in  a  in  the  same  order  but  not 
necessarily  consecutively.  Find  an  0()a)(6()  algorithm  that  finds  the  longest  string  that  is  a  substring  of 
both  a  and  6. 

PltOHl.KM  23: 

(a)  Find  a  linear,  off-line  algorithm  that  finds  the  least  common  ancestor  (LCA)  of  nodes  of  a  complete 

binary  tree.  'Flic  least  common  ancestor  of  two  nodes  is  the  node  lhat  is  an  ancestor  of  both  nodes 
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and  is  a  dcsccndcnt  of  all  such  nodes.  All  nodes  in  a  complete  binary  tree  have  zero  or  two  children. 
All  of  the  leaves  of  a  complete  binary  tree  arc  at  two  adjacent  levels.  For  an  off-line  algorithm,  the 
tree  and  all  queries  arc  available  before  any  query  must  be  answered.  Ilic  algorithm  must  answer  q 
queries  about  an  n  node  tree  in  0(n  +  q)  time.  (Hint:  Ii  is  possible  to  use  radix  sort  to  sort  a  nodes 
in  0(n  4-  a)  time.) 

(b)  Kxtend  your  algorithm  in  (a)  to  work  for  balanced  trees.  A  balanced  n-nodc  tree  has  depth  O(logn). 
(Hints:  The  algorithm  should  still  be  linear,  but  the  constant  will  depend  on  the  constant  in  the 
depth.  Conceptually,  extend  the  tree  to  make  it  complete.) 

(c)  |Kxtra  crcdit|  Consider  the  set  of  all  union  trees.  'Iliat  is  trees  that  arc  created  by  the  UNION 
algorithm  with  the  weighting  heuristic  and  without  path  compression.  Prove  that  any  such  tree  can 
be  made  into  a  balanced  binary  tree  in  linear  time. 

(d)  You  arc  to  show  that  the  LCA  problem  for  general  trees  can  be  reduced  to  that  for  union  trees. 
Consider  a  general  tree  T.  It  has  a  corresponding  union  tree,  U.  The  nodes  of  U  correspond  to  the 
leaves  of  T.  Consider  an  internal  node,  v  of  7\  Suppose  that  all  of  the  sons  of  v  correspond  to  sets 
in  U .  Then  modify  U  so  that  there  is  a  set  that  is  the  union  of  all  of  the  sets  that  correspond  to 
sons  of  v.  Make  v  correspond  to  die  node  representing  this  set. 

(i)  Show  the  sets  corresponding  to  the  sons  of  v  are  distinct.  Therefore  the  procedure  above  is  well 
defined. 

(ii)  Show  that  U  can  be  computed  from  T  in  linear  time. 

(iii)  Suppose  that  a  and  b  arc  leaves  of  T  that  correspond  to  A  and  B  in  U.  Let  C  be  the  least 
common  ancestor  of  A  and  B  in  U .  Show  that  U  and  T  can  be  prcproccsscd  in  linear  time  so  that 
the  least  common  ancestor  of  a  and  b  in  T  can  be  found  in  constant  lime,  given  C. 

PROBLEM  24: 

Prove  die  following  characterization  given  in  class: 

A  connected,  undirected  graph  is  biconncctcd  iff  every  pair  of  distinct  edges  lies  on  a  common  simple 
cycle. 


Problem  25: 

Prove  that  in  a  connected  undirected  graph  the  articulation  points  connect  the  blocks  in  form  of  a 

tree. 

Problem  26: 

Give  a  nonrccursivc  implementation  (PASCAl.-like)  of  the  DFS  routine  (for  directed  and  undirected, 
not  necessarily  connected  graphs)  using  a  LIFO  queue. 

Problem  27: 

Let  G  —  (V,Ii)  be  an  undirected  graph.  A  set  C  C  V  is  called  a  clique  iff  any  two  distinct 
v,w  €  C  are  connected  by  an  edge  in  Ii.  Let  C  be  a  clique,  and  F  a  PFS-forcst  of  G.  Prove  that 
all  w  €  C  lie  on  one  branch  of  a  tree  in  F.  Do  they  occur  contiguously  on  this  branch?  Justify  your 
answer. 

Problem  28: 

A  Eulerian  circuit  in  a  connected  undirected  graph  G  =  (V,  E)  is  a  circuit  which  contains  each  edge 
of  the  graph  exactly  once. 

(a)  Prove  that  G  has  a  Eulerian  circuit  iff  every  node  has  even  degree  (i.c.,  an  even  number  of  edges 
incident  upon  it). 
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(b)  Design  an  algorithm  of  complexity  0(\E\)  which  determines  a  Kulerian  circuit  of  G  if  there  is  one. 
PROM, KM  29: 

Give  an  example  of  a  phna r  graph  (without  self-loops  and  parallel  edges)  where  every  node  has 
degree  at  least  5. 


PllOM.F.M  30: 

(a)  Give  the  formulas  for  a  coordinate  transformation  from  the  surface  of  the  sphere  onto  the  plane 
which  transforms  the  plane  drawing  of  a  (finite)  graph  on  the  sphere  into  a  plane  drawing  in  the 
plane  with  a  prespecified  outer  face. 

(b)  Give  a  mcromorphic  transformation  of  the  (complex)  plane  (i.e.,  a  transformation  of  the  form 
f(z)  =  £„>manzn  for  some  integer  m  converging  in  some  region  of  the  plane)  which  achieves 
the  same  task  for  a  plane  graph  given  in  the  plane. 

PllOHI. KM  31: 

The  dual  graph  of  a  plane  graph  G  is  given  by  a  node  for  every  face  of  G  and  an  edge  connecting 
two  nodes  for  every  edge  of  G  which  is  on  the  common  boundary  line  of  the  two  faces  corresponding  to 
the  nodes. 

(a)  Prove  that  the  dual  of  a  plane  graph  is  planar. 

(b)  Prove  that  the  dual  of  the  dual  is  isomorphic  to  the  graph  itself. 

(c)  Prove  that  every  plane  graph  has  either  a  node  of  degree  <  2  or  a  face  with  at  most  5  edges  on  its 
boundary. 

Prom. KM  32: 

Discuss  how  to  obtain  (using  die  planarity  testing  algorithm  presented  in  class)  an  adjacency  list 
representation  of  a  planar  graph  G  where  for  each  node  the  incident  edges  appear  in  the  adjacency  list  in 
the  order  given  by  dicir  clockwise  ordering  in  some  plane  drawing  of  G  (modulo  die  starting  direction). 

Prom, km  33: 

a)  Find  an  Cl(|  V'  ||/s7|)  algoridim  to  solve  the  single  source  shortest  padi  problem  if  edges  with  negative 
weights,  but  no  negative  weight  cycles  are  allowed. 

b)  Modify  your  algorithm  so  diat  it  solves  the  single  source  shortest  path  problem  or  finds  a  negative 
weight  cycle  given  a  graph  with  possibly  negative  edge  weights. 


Prom, KM  34: 

F  ind  a  linear  time  algorithm  to  6-color  a  planar  graph.  A  graph  is  n-colorcd  if  every  node  is  assigned 
one  of  «  colors  and  no  two  nodes  connected  by  an  edge  have  the  same  color. 


Prom.km  35: 

a)  Prove  that  any  planar  graph  can  be  5-colorcd.  (Hint:  If  a  vertex  has  degree  five,  contract  it  and  two 
of  its  ncigbhors.  Use  without  proof  that  this  preserves  planarity.) 

b)  Convert  your  proof  into  an  algorithm  to  5-color  a  planar  graph.  Analyze  its  complexity  in  terms  of 
|  V  |.  the  number  of  vertices  in  the  graph. 

c)  Find  a  linear  time  algorithm  to  5-color  a  planar  graph.  (Hint:  Use  without  proof  die  fact  that  every 
planar  graph  has  a  vertex  of  degree  four  or  loss  or  it  has  a  vertex  of  degree  five  with  four  neighbors 
all  of  degree  at  most  11.) 
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Problem  36: 

Simulate  the  Hoperoft-Tarjan  planarity  testing  algorithm  on  the  following  graph.  There  are  7  vertices 
vu ...  ,v7.  There  arc  15  edges.  Kach  vertex  v{  is  connected  to  v,+m  for  i  =  —  m  and 

m  =  1,2,3.  Show  all  phases  of  the  algorithm. 

Problems  for  the  Closed  Book  I'inal: 


Problem  37: 

Investigating  odd-even  merging  the  following  recurrence  arises: 

/*  =  2*  -  1  -  /*_„  /o  =  0. 


What  is  die  solution? 

Problem  38: 

Give  die  position  tree  for  the  string  aaaabaaabaabab$. 


Problem  39: 

Prove  that  an  undirected  graph  is  biconncctcd  if  and  only  if  for  every  triple  a,  b,  c  of  nodes  there  is 
a  simple  padi  from  a  to  c  through  b. 

Problem  40: 

(a)  How  would  you  have  to  define  the  capacity  of  a  cut  in  a  network  with  (nonnegativc)  lower  bounds 
such  that  the  Min-Cut  Max-Flow  Theorem  holds? 

(b)  Give  a  detailed  proof  of  a  “Min-Flow  Max-Cut”  theorem  (see  what  function  of  a  cut  is  maximum). 
Problem  41: 

Call  the  sum  over  the  degrees  of  the  free  nodes  with  respect  to  some  matching  rn  the  deficiency 
d{M).  Give  an  efficient  algorithm  to  find  a  matching  with  minimum  deficiency.  You  may  describe  a 
reduction  to  some  problem  discussed  in  class.  What  is  the  complexity  of  the  algorithm? 

Problem  42: 

Give  a  program  for  a  version  of  Dijkstra's  shortest  path  algoridim  which,  for  some  given  node  a,  in 
addition  to  all  distances  dis[a,  6],  also  determines  the  minimal  number  of  edges  on  a  shortest  padi  from 
a  to  b.  for  every  node  b.  Establish  the  correctness  of  your  algorithm  and  its  time  complexity. 

Problem  43: 

Let  C  =  (V,  li)  be  a  digraph  with  a  (nonnegative)  distance  function  d  on  the  edges,  and  suppose 
that  we  want  to  find,  for  some  fixed  a  E  V,  the  length  of  a  second  shortest  path  from  a  to  b,  for  every 
6  €  V .  (Such  a  second  shortest  path  need  not  be  simple,  but  of  course  it  reaches  its  endnodc  just  once! 
It  may  also  have  the  same  length  as  a  shortest  path,  we  only  require  it  to  be  different  from  some  shortest 
path  and  having  minimal  length  among  all  such  paths.) 

(a)  What  is  the  length  di.s2[«]  of  a  second  shortest  path  from  a  too? 

(b)  Suppose  some  shortest  path  from  a  to  b  6  V  contains  just  one  edge.  What  is  di»2[b\? 

(c)  Give  an  algorithm  to  compute  dta‘2[fc)  for  all  b  £  V. 

(d)  Determine  die  time  complexity. 
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